Ayadi Tahar | Main

[draft] What is a Data Lake ?

Publish Date: 2022-10-17

What is a Data Lake ?

A data lake is a central environment or repository where you can hold and store all your data in its native format, until needed for analytic applications - from dashboards and visualizations to big data processing, real-time analytics, and machine learning, in order to guide for better decisions...

read more

Realtime Analysis using Spark Streaming and Kafka

Publish Date: 2022-10-09

Realtime Analysis using Spark Streaming and Kafka

Spark Streaming is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams. Data can be ingested from many sources like Kafka, Kinesis, or TCP sockets, and can be processed using complex algorithms expressed with...

read more

Apache Spark and Cassandra : Best NoSQL Big Data combination

Publish Date: 2022-10-07

Apache Spark and Cassandra : Best NoSQL Big Data combination

Apache Spark™ is considered as the most powerful engine for executing data engineering, data science, and machine learning on single-node machines or clusters over diverse data sources such as NoSQL databases.

Apache read more

How to use and Implement a Stack ?

Publish Date: 2022-10-04

How to use and Implement a Stack ?

A stack represents a sequence of objects or elements in a linear data structure format and is based on the principle of Last In First Out (LIFO).

It is commonly used as an abstract data type with two major operations namely push and pop, which are carried out on the topmost el...

read more

Read/Write data from/to PostgreSQL tables using Spark

Publish Date: 2022-10-02

Read/Write data from/to PostgreSQL tables using Spark

Apache Spark is a fast and general computing engine used for big data processing. It can process data from different sources and formats, one of them is data from relational databases like PostgreSQL.

In our article, we will show you the steps to on how to read data from PostgreSQL t...

read more