Ayadi Tahar | Main

Random Forest Regression using Pyspark

Publish Date: 2022-10-28

Random Forest Regression using Pyspark

Regression is a statistical method used in many areas like finance, investing, healthcare and many others which try to identify the strength of the relationship between one dependent variable (usually denoted by Y) and a series of other variables (known as independent variables or fea...

read more

Spark Structured Streaming with CSV files

Publish Date: 2022-10-21

Spark Structured Streaming with CSV files

Structured Streaming is a scalable and fault-tolerant stream processing engine built on the Spark SQL engine. You can express your streaming computation the same way you would express a batch computation on static data. The Spark SQL engine will take care of running it incrementally and...

read more

The principles of Heuristic Function and GPS coordinates

Publish Date: 2022-10-20

The principles of Heuristic Function and GPS coordinates

In sphere we have no strait lines, so the distance between 2 points on the surface of sphere measured by geodesics which is circles whose centers coincide with the center of the sphere, and they are called great circles.

Between antipodal points (directly opposite each other ), the...

read more

Implement A* Shortest Path using Neo4j

Publish Date: 2022-10-17

Implement A* Shortest Path using Neo4j

The A* (pronounced "A-Star") Shortest Path algorithm computes the shortest path between two nodes. A* is an informed search algorithm as it uses a heuristic function to guide the graph traversal. The algorithm supports weighted graphs with positive relationship weights.

Today ...

read more

[draft] What is a Data Lake ?

Publish Date: 2022-10-17

What is a Data Lake ?

A data lake is a central environment or repository where you can hold and store all your data in its native format, until needed for analytic applications - from dashboards and visualizations to big data processing, real-time analytics, and machine learning, in order to guide for better decisions...

read more