PinnedSpark Shines brighter with Project TungstenWhat is the purpose of Project Tungsten?May 22, 20221May 22, 20221
PinnedHow Apache Druid becomes a game changer in Big Data Pipelines?About Apache DruidJul 28, 20221Jul 28, 20221
PinnedApache Druid Architecture and core concepts in layman termsApache Druid is a leading database for modern analytics applications. The main features of a modern analytics applications are,Aug 2, 2022Aug 2, 2022
PinnedWhat is EMRFS and what value it adds to the S3 file system?EMRFS is not a separate file-system, it’s an implementation of HDFS that adds strong consistency to S3, whereas S3 without EMRFS…Apr 4, 20201Apr 4, 20201
Snowflake Summit 2023: Dynamic TablesSnowflake: Revolutionizing Data Storage, Processing, and Analytics in the CloudJul 8, 2023Jul 8, 2023
Data and AI Summit 2023: Delta Lake 3.0 UniForm, Unifying analytics and AI on your dataOn the Data and AI Summit 2023, Databricks made an exciting announcement regarding the latest release of Delta Lake 3.0 to Linux…Jul 2, 2023Jul 2, 2023
Upgrade your Data Engineering Game with Deequ: A Framework for Flawless AnalyticsIn today’s era of exponential data growth and rapid data processing, ensuring data quality is paramount. High-quality data serves as the…Jun 26, 2023Jun 26, 2023
Elevate Your Data Processing: Real-Time Stream Analytics with Apache Flink’s Event TimeIntroduction to Event Time Processing:Jun 1, 2023Jun 1, 2023
How Apache Druid differs from other Big Data GiantsFor better understanding of a tool/technology, it is important to compare & contrast with other similar tools in the market. It gives us a…Aug 1, 20221Aug 1, 20221