In this video, our consultant Andreia Negreira breaks down the concept of parallel processing using a real-world example. She then provides a brief history of Apache Hadoop, highlighting its limitations and how Apache Spark emerged as a solution.
Furthermore, Andreia explores the evolution of data platforms, tracing the journey from Data Warehouses to Data Lakes and ultimately to Data Lakehouses. To conclude, she offers a brief introduction to the Databricks data platform, showcasing its role in modern data processing and analytics.
Watch now to gain valuable insights into the ever-evolving world of data!
In June 2023 Andreia joined the AI class at BeCode, where she found the perfect environment for a hands-on training, enhancing her problem-solving skills cultivated during her studies in Environmental Engineering and her master's in chemical and Biochemical Process Engineering. With experience in data integration, streaming, ETL design, and end-to-end data science pipelines, she is able to bring a fresh perspective and a constant passion for innovation to the tech world, fueled by her analytical and problem-solving skills developed throughout her career.
At Big Industries, we want to share our wisdom. More info on our recent developments, newest projects and upcoming events
You are cordially invited to the AWS re:Invent 2024 re:Cap, taking place on January 21st at Salons...
Databricks Community Edition is a free, cloud-based version of the Databricks platform tailored...
Stay informed about our recent developments, newest projects and upcoming events