Join us in New York City for the second annual DataGalaxy Tech Summit! To celebrate the event’s second edition, DataGalaxy is organizing an evening of tech networking in New York City.
This in-person Summit brings together industry experts to share their insights on optimizing data models, choosing the best data storage formats, and insights on streamlining data ingestion processes.
This year, we’re diving deep into data modeling and storage – Two of the most crucial aspects of data engineering.
We’ll be delving into advanced strategies for data modeling, the benefits of using modern storage formats such as Iceberg and Delta, and the efficiency of direct data ingestion with cutting-edge tools like Redpanda. Whether you’re a Data Engineer, Data Analyst, data leader, or a passionate data consumer, the DataGalaxy Tech Summit will provide you with cutting-edge knowledge and practical solutions to enhance your data infrastructure.
We look forward to seeing you there!
Program
Mixed Model Arts - The Convergence of Data Modeling Across Apps, Analytics, and AI
For decades, data modeling has been fragmented by use cases: applications, analytics, and machine learning/AI. This leads to data siloing and “throwing data over the wall.”
With the emergence of AI, streaming data, and “shifting left” are changing data modeling, these siloed approaches are insufficient for the diverse world of data use cases. Today’s practitioners must possess an end-to-end understanding of the myriad techniques for modeling data throughout the data lifecycle. This presentation covers “mixed model arts,” which advocates converging various data modeling methods and the innovations of new ones.
Duck DB: How to put it to work today?
How did we get to the point that a single machine can compete with distributed systems ? DuckDB opens up many creative possibilities for data engineers, putting a column store database engine in a tiny library you put anywhere. Where does DuckDB shine ?
What makes for a good pilot project in your organization ? Can you really replace your warehouse with DuckDB somehow ?
Hear the Latest on Next Gen Streaming, by Redpanda
Redpanda is a new Kafka known for its operationally simple, developer friendly approach. Recently, Redpanda has been making significant changes to its platform with capabilities such as native Topic -> Iceberg integration, flexible topic configuration for performance vs cost and recently acquired Benthos which get immediate upgrades with WASM and GPU / AI integrations.In this talk we’ll get an update from Redpanda on some of the details behind these new features and more, and will understand what else is on the roadmap. You will also have a chance to ask questions and get some Redpanda swag!
Data Modeling - Let’s learn from the past
In this talk, we’ll explore how modern data professionals are often unaware of the lessons from past data architecture mistakes—leading to inefficiencies and avoidable errors. We’ll journey through the evolution of data models, from the simplicity of the star schema to the complexity of the snowflake schema, and examine how today’s Medallion Architecture offers a fresh, more agile approach compared to traditional data warehousing methods.
By embracing tried-and-true principles while adopting new innovations, we can create data architectures that are not only efficient but scalable and adaptable to future challenges.