31 May 2023

data-driven decision making

Data-Driven Decision-Making with DataGalaxy & Starburst

[Webinar recap]

DataGalaxy and Starburst, pioneers in collaborative data governance and SQL analytics engines for data lakes and federations, recently presented a joint webinar showcasing their services’ powerful, new integration. Offering an innovative solution for managing and organizing data from multiple sources, the webinar features Laurent Dresse, DataGalaxy’s Chief Evangelist, Victor Coustenoble, Starburst Solutions Architect Manager, and Julien Laguilhomie, DataGalaxy Product Manager.

The integration of DataGalaxy and Starburst combines the two solutions, creating a synergistic approach to data management and governance. This collaboration enables organizations to leverage the unique capabilities of both platforms, maximizing data discovery, accessibility, and organization.

DataGalaxy and Starburst Features: Enriching Data Understanding

DataGalaxy’s data knowledge catalog allows users to easily search, discover, and explore their company’s data. This comprehensive solution spans the crucial steps of finding, understanding, trusting, and consuming data. Users can seamlessly embark on the data discovery process by leveraging DataGalaxy’s advanced features, such as the AI-assisted business semantic layer, knowledge graph navigation, and the user-friendly Chrome extension.

But the value proposition extends beyond discovery. With DataGalaxy’s integration of enriched Starburst data products and the powerful Trino SQL engine, users can dive deeper into understanding and trusting the data. The Data Knowledge Studio empowers users with intuitive business and tech diagramming capabilities, automated lineage tracking, collaborative features, and detailed definitions. This ensures a comprehensive understanding of the data, its sources, and its relationships.

Building trust in the data is vital, and DataGalaxy provides a suite of features to instill confidence. Data and metadata quality indicators, roles and responsibilities for data ownership, and usage metrics contribute to data governance and trustworthiness. This foundation allows users to make informed decisions based on reliable, accurate, and trusted data.

Seamless Data Consumption

When consuming the data, the integration with Starburst and Trino SQL offers significant advantages. Users can leverage a common language (SQL) and a common security layer, ensuring seamless authentication, access policies, and data masking. The versatility and interoperability of the solution allow easy integration with various tools and platforms, providing flexibility and accessibility. Additionally, features like sampling enable efficient data retrieval, supporting quick and efficient data-driven decision-making.

DataGalaxy’s integration with Starburst and the powerful Trino SQL engine provides an end-to-end solution, accelerating the journey of data consumers from a business question to a data-driven decision. With advanced capabilities for finding, understanding, and trusting data, coupled with seamless data consumption features, organizations can effectively manage their data as a valuable and usable asset, promoting its usage and delivering actionable insights.

Streamlined Federated Data Management

Starburst’s query federation capabilities enable users to simultaneously access and combine data from multiple sources, such as Snowflake, Oracle, and AWS S3 data lakes. “The idea of Starburst is to join and perform data federation between different sources,” stated Victor Coustenoble, Starburst Solutions Architect Manager. Data Products and their metadata created through Starburst are referenced by DataGalaxy, enriched by their business knowledge layer, and then quickly served to their clients.

This federated data management approach allows users to create comprehensive data products by incorporating information from various systems, improving overall data quality, and providing a more holistic view of their data landscape.

Integrating DataGalaxy and Starburst

The integration between DataGalaxy and Starburst enables users to access and use data products from numerous popular business intelligence tools, such as Tableau Software. This seamless tool integration streamlines data-driven decision-making processes by allowing users to quickly visualize and analyze data products within their preferred analytics platforms. “The idea is to quickly find and understand your data from a business point of view and make it easily usable within your BI tool,” remarked Victor.

With the integration of DataGalaxy and Starburst, users can easily enrich Starburst data products within the DataGalaxy catalog. This streamlined process ensures that data products remain accurate, up-to-date, and aligned with the organization’s evolving data needs, fostering a culture of continuous data improvement and optimization.

“One way to be data-driven at scale is to think about the data that you have not as just an asset but more like an internal product that people own and bring to life,” explained Julien Laguilhomie, DataGalaxy Product Manager.

The partnership between DataGalaxy and Starburst also fosters a collaborative approach to data governance. By leveraging DataGalaxy’s data knowledge workplace features, organizations can establish consistent governance policies and procedures, ensuring data products are secure, well-documented, and approved for business use. This collaborative data governance framework allows users to confidently share and manage data products.

Key Takeaways from the DataGalaxy and Starburst Webinar

During the webinar, the presenters illustrated the integration between DataGalaxy and Starburst, demonstrating how these powerful solutions optimize data management, governance, and accessibility. The demonstration used a realistic use case and followed a step-by-step approach, enabling attendees to understand the capabilities offered by this unique partnership.

#1: Empowering Users to Search, Discover, and Explore Data Products

Beginning with a walkthrough of DataGalaxy’s user interface, Julien showed attendees how to search, discover, and explore data products within the platform. Highlighting the platform’s powerful search functionality, intuitive navigation, and detailed metadata display, Julien emphasized how these features facilitate efficient data discovery and exploration.

“The most important factor is how you share information within your organization. You can spend a lot of time on integration,” noted Laurent, “but if you don’t share it with your people, what’s the point?”

#2: Boosting Data Accessibility and Leveraging Data Federation for Enhanced Dataset Creation

Next, Victor demonstrated how to create, access, and use data products directly within Starburst’s UI. He showcased the seamless integration between the two platforms, highlighting how users can query and analyze data products across multiple sources using Starburst’s powerful query federation capabilities. The benefits of Starburst’s data product features, such as improved data visibility, consistent governance, and ultimate accessibility, were also emphasized.

Victor demonstrated how users could create datasets by leveraging data federation between various sources, including Snowflake, Oracle, and AWS S3 data lakes. Victor revealed how Starburst combined data from multiple systems to create a unified, comprehensive data product. This example illustrated the versatility of Starburst’s query federation capabilities and the value of integrating diverse data sources.

#3: Transforming Data Visualization with DataGalaxy and Starburst Integration

It was then demonstrated how users could utilize Starburst data products within popular business intelligence tools like Tableau Software. They connected Tableau to Starburst and showcased importing data products, creating insightful visualizations, and generating interactive reports and dashboards to support data-driven decision-making processes.

#4: Streamlined Starburst Updates for Accurate Data

Finally, the presenters showed how users could request an update of a Starburst data product within the DataGalaxy catalog. Victor illustrated modifying data product information, editing metadata, and synchronizing changes between DataGalaxy and Starburst. This section emphasized the importance of maintaining accurate, up-to-date data products and fostering a culture of continuous data improvement and optimization.

Throughout the demonstration, the presenters provided practical tips and best practices, enabling attendees to understand the powerful capabilities offered by integrating DataGalaxy and Starburst. By showcasing real-world examples and use cases, the webinar effectively illustrated how organizations could leverage this unique partnership to optimize their data management, governance, and accessibility.

Embracing the Future of Data Management

Integrating DataGalaxy and Starburst marks a significant advancement in data management, governance, and accessibility. As demonstrated during the webinar, this unique partnership offers organizations a comprehensive and robust solution to optimize their data-driven decision-making processes and unlock the full potential of their data assets.

Combining Starburst’s query federation capabilities and DataGalaxy’s domain-driven data cataloging, users can efficiently search, discover, and utilize data products simultaneously from various sources. This seamless collaboration streamlines data discovery and access and fosters a collaborative approach to data governance. This combination of each platform’s core strengths enables organizations to establish consistent policies and procedures while promoting a culture of data transparency and trust.

Moreover, the integration supports popular business intelligence tools, allowing users to visualize and analyze data products within their preferred analytics platforms. This flexibility empowers organizations to make informed decisions based on accurate, relevant, and timely information, ultimately driving better business outcomes and competitive advantage.

The DataGalaxy and Starburst partnership signifies a significant step forward for organizations seeking to harness the power of their data assets. By leveraging the insights and best practices shared during the webinar, organizations can effectively implement these technologies and capitalize on the benefits of this groundbreaking collaboration.

Embracing this integrated approach to data management and governance will enable organizations to navigate the complexities of the modern data landscape and thrive in an increasingly data-driven world.

DataGalaxy and Starburst: Pioneers in Data Management

Starburst, founded in 2017, is a leading provider of SQL analytics engines for data lakes and federations. With more than 600 employees, including the creators of Presto, Starburst supports a variety of data strategies, such as Data Mesh and Data Lakehouse, through query federation, embedded analytics engines, and more. The company focuses on providing data products, streamlining visibility, ensuring consistent governance, and offering ultimate accessibility.

DataGalaxy, Founded in 2015 in Lyon, DataGalaxy is the pioneer of collaborative data governance in France. DataGalaxy is the industry’s first data knowledge catalog helping organizations understand how their business runs on data. An established leader in Europe, growing rapidly and operating worldwide, DataGalaxy offers a user-centric platform dedicated to metadata mapping, active metadata management, and metadata knowledge sharing. With its innovative approach to data governance and cataloging, DataGalaxy helps businesses of all sizes gain control over their data assets and make better, more informed decisions.

The DataGalaxy and Starburst webinar provided an in-depth look at integrating their solutions, emphasizing the value this partnership brings to organizations seeking to optimize their data management, governance, and accessibility. As more companies recognize the importance of leveraging data as a strategic asset, the collaboration between DataGalaxy and Starburst presents a compelling solution to navigate the complexities of the modern data landscape.

By embracing the insights and best practices shared during the webinar, organizations can take advantage of this groundbreaking collaboration and unlock the full potential of their data assets, driving better business outcomes and achieving a competitive advantage.

Watch the webinar replay here.

Comment structurer une organisation Data-Driven ?

Autres articles

Data Owner: Definition & responsibilities

Data Owner: Definition & responsibilities

Data Owner: Definition & Responsibilities In the ever-expanding universe of data, every byte of information holds value. But who truly holds the reins over this data, determining its usage, access, and trajectory? This responsibility rests upon the shoulders of...