Engineering uptime

blog by recode community.

Explore articles

Find the right guide

Design

Land a Job in UI/UX Design

Are you passionate about design and dreaming of a career in it? Or maybe you are already in the design space and looking to pivot into UI/UX?

Sowmiya Venkatesan,Sanjay Viswanthan

5 min read

Read Article →

Development

What is GitHub Copilot

The GitHub Copilot Coding Agent is an asynchronous software engineering agent that assists developers by suggesting code snippets

Sanjay Viswanthan

5 min read

Read Article →

Development

Apache Spark Architecture Explained

Apache Spark is a fast, open-source big data framework that leverages in-memory computing for high performance. Its architecture powers scalable distributed processing across clusters, making it essential for analytics and machine learning.

Aditya Singh Rathore,Sanjay Viswanthan

5 min read

Read Article →

Development

N8N: The Future of Workflow Automation

N8N is an open-source workflow automation tool that enables users to connect various apps and services to automate tasks without extensive coding knowledge.

Aditya Singh Rathore

5 min read

Read Article →

AI & Tech

OpenAI AgentKit: Building AI Agents Without the Complexity

OpenAI AgentKit is a framework that simplifies the process of building AI agents, allowing developers to create intelligent applications without getting bogged down in the underlying complexities.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Delta Lake: An Introduction to Trustworthy Data Storage

Delta Lake is an open-source storage layer that brings ACID transactions to Apache Spark and big data workloads.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

How I Cleared the Azure Data Engineer Associate Certification

The Microsoft Certified: Azure Data Engineer Associate certification validates your skills in designing and implementing data solutions on the Azure platform.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Microsoft Fabric: One Platform, One Lake, Every Data Workload

Microsoft Fabric is a unified analytics platform that integrates various data services and tools to provide a seamless experience for data professionals, enabling them to manage and analyze data across the entire data lifecycle.

Aditya Singh Rathore

5 min read

Read Article →

security

How SSO Actually Works

SSO lets you log into dozens of apps with a single set of credentials. But how does it actually work under the hood? A beginner-friendly walkthrough of the full flow — from clicking 'Sign in with Google' to getting access — step by step.

Aditya Singh Rathore,Sanjay Viswanthan

5 min read

Read Article →

data engineering

Lakehouse vs Data Warehouse: A Comprehensive Comparison

Lakehouse and Data Warehouse are two different data storage architectures. A Data Warehouse is a centralized repository for structured data, optimized for reporting and analysis. A Lakehouse combines the best of both worlds, allowing for the storage of both structured and unstructured data, providing flexibility and scalability.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

How Netflix Handles 2 Trillion Events Every Day

Netflix processes an enormous amount of data every day, handling over 2 trillion events. This article explores how they manage this massive scale and the technologies they use to ensure smooth operations.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Azure Storage & ADLS Gen2: Where Does Your Data Actually Live?

Azure Storage and Azure Data Lake Storage Gen2 (ADLS Gen2) are two different storage solutions offered by Microsoft Azure. Azure Storage is a general-purpose storage service that provides various types of storage, including blobs, files, queues, and tables. ADLS Gen2, on the other hand, is a specialized storage solution designed for big data analytics workloads, offering features like hierarchical namespace and optimized performance for analytics.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Azure Data Factory Pipeline: Build Your First ETL in 10 Minutes

Azure Data Factory Pipeline is a cloud-based data integration service that allows you to create data-driven workflows for orchestrating and automating data movement and transformation tasks. This article will guide you through the process of building your first ETL pipeline in Azure Data Factory.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Medallion Architecture: How to Stop Your Data Pipeline from Becoming a Nightmare

The Medallion Architecture is a data management approach that organizes data into different layers (Bronze, Silver, Gold) to improve data quality, governance, and scalability in data pipelines. It helps prevent data pipelines from becoming unmanageable by providing a structured framework for data processing and storage.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Azure Synapse Analytics: When to Use It (And When to Choose Fabric Instead)

Azure Synapse Analytics is a unified analytics service that combines big data and data warehousing capabilities. This article explores when to use Azure Synapse Analytics and when to choose Fabric instead.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Why We Rolled Back Our Kafka Pipeline to Batch After 6 Months

Streaming pipelines are powerful for real-time data processing, but they come with hidden costs that are often overlooked. These costs include increased complexity, higher resource consumption, and potential challenges in maintaining data consistency and reliability. This article explores these hidden costs and provides insights on how to mitigate them.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

Azure Data Pipeline Cost Optimization: How We Cut a $4,200 Bill by 73%

Azure Data Pipeline can be a powerful tool for data processing and analytics, but it can also lead to unexpectedly high costs if not managed properly. In this article, we share our experience of optimizing our Azure Data Pipeline costs, which resulted in a 73% reduction in our monthly bill, saving us $3,066. We discuss the strategies we implemented to achieve this significant cost reduction while maintaining the performance and reliability of our data pipeline.

Aditya Singh Rathore

5 min read

Read Article →

data engineering

PySpark Optimization Techniques: 6 Mistakes That Slow Down Every Beginner's Pipeline

PySpark is a powerful tool for big data processing, but it can be challenging to optimize for performance. In this article, we discuss six common mistakes that beginners make when optimizing their PySpark pipelines, which can lead to slow performance and increased costs. We provide practical tips and techniques to help you avoid these pitfalls and improve the efficiency of your PySpark applications.

Aditya Singh Rathore

5 min read

Read Article →