Recent Posts

How to Fix OOM Errors in Spark

2 minute read

A detailed guide on understanding and resolving Out of Memory (OOM) errors in Apache Spark.

Query Engines

2 minute read

Discover the evolution of query engines like Hive, Presto, and Spark, and learn how they revolutionize data processing with scalability, speed, and versatili...

Creating AWS API Gateway Private Endpoints

5 minute read

Learn how AWS API Gateway Private Endpoints use AWS PrivateLink to securely expose APIs within a VPC, ensuring data stays off the public internet.

Poetry for Dependency Management

1 minute read

Explore how Poetry manages Python dependencies and ensures reproducible setups across systems, enhancing project collaboration

Auto-GPT is overhyped.

1 minute read

Auto-GPT utilizes OpenAI’s API to perform tasks autonomously but can be overhyped due to its repetitive nature and high cost.

A Framework for EDA

8 minute read

Guide on performing Exploratory Data Analysis (EDA) when you start your machine learning project