Business Intelligence, Big Data Management, Data Science

data.world Announces Data Governance and Data Catalog Integrations with Snowflake’s Snowpark

data.world Announces Data Governance

data.world, the data catalog platform, today announced at Snowflake’s annual user conference, Snowflake Summit 2023, an integration with Snowpark, the developer framework for Snowflake, the Data Cloud company. Through the integration, data.world provides data catalog, data governance, and DataOps applications that help Snowflake customers accelerate Snowpark migrations and govern Snowpark metadata. The integration follows data.world’s recent achievements of Powered by Snowflake, Data Governance Accelerated, and Snowflake Premier Partner status.

“For our customers to be able to deploy data-intensive applications in the language of their choice – whether that’s Java, Python, or Scala – they first need to be able to find, understand, and trust the underlying data,” said Tarik Dweik, Head of Technology Alliances at Snowflake. “The integration with data.world will improve productivity for data engineers and provide answers around what functions are available, what jobs have been run and by whom, and when was this data last refreshed.”

“Businesses can achieve significant cost savings and fuel innovation by migrating data assets to Snowflake,” said Lofan Leung, Head of Technology Alliances at data.world. “Our integration empowers joint customers to seamlessly manage and gain visibility into their data landscape, with enterprise-grade governance that enables organizations to embrace the cloud confidently.”

Locating information is often challenging for data teams. They need metadata on functions, the lineage of those functions, and stored procedures to create a common understanding of data-intensive applications. data.world catalogs these Snowpark functions, enabling developers, data engineers, data scientists, and data product owners to get deeper access to the resources they care about and deliver enterprise-wide benefits, such as:

  • Improved productivity: Tags for Snowpark functions based on metadata enable data teams to quickly find the relevant data and applications they need to support specific analytical use cases and business objectives.
  • Increased efficiency: A clear understanding and visibility of what is available and for what purpose encourages data teams to share and reuse existing assets, saving development time and costs.
  • Trust across the team: Trusted data and applications enable better decision making for downstream users with more visibility and improved understanding of how these objects are related to each other and support the business objectives.
  • Faster and more reliable migration of data applications to the cloud: Teams are able to track statuses to manage migration workflow, create prioritized lists with high-value assets for higher return on investment, analyze data lineage for Snowpark UDFs to identify dependencies objects to lower migration risks, and document migration decisions in a centralized catalog platform.

data.world’s unique knowledge graph architecture enables users to model and map key business concepts and relationships to any data asset. data.world establishes relationships between tables, views, functions, stored procedures, tasks, workflow, jobs, data applications with people, and business processes in a more holistic way. As a result, data teams with different backgrounds gain better insights faster across different domains to understand exactly how the data assets are related and being consumed by the business.

Be sure to check out the Snowflake Summit 2023 keynotes live or on-demand here and stay on top of the latest news and announcements from Snowflake on LinkedIn and Twitter.

To learn more about the Snowpark integration and the data.world Data Catalog Platform, visit data.world at Snowflake Summit, booth 1241.

About data.world

data.world is the data catalog platform. Its cloud-native SaaS (software-as-a-service) platform combines a consumer-grade user experience with a powerful knowledge graph to deliver enhanced data discovery, agile data governance, and actionable insights. data.world is a Certified B Corporation and public benefit corporation and home to the world’s largest collaborative open data community with more than two million members, including ninety percent of the Fortune 500. Our company has sixty-two patents and has been named one of Austin’s Best Places to Work eight years in a row.

Spotlight

Spotlight

Related News

Big Data

Provider Density Data from LexisNexis Risk Solutions Shows Inequality of Provider Availability Across Regions

PR Newswire | October 06, 2023

LexisNexis® Risk Solutions, a leading provider of data and analytics, released new insights on the latest national and regional provider density trends for primary and specialty care. The analysis explores how often prescriber data changes, the metropolitan areas seeing the biggest change in the number of primary care providers (PCPs) and the metropolitan areas with the highest and lowest number of heart disease patients per cardiologist. Outflows of providers and coverage ratios can impact a community's ability to deliver accessible and efficient care, and with a looming shortfall of PCPs[1], it's important to understand where the existing PCPs are located. The analysis reveals the five metropolitan areas with the highest percent increase and decrease of PCPs between June 2022 and June 2023. According to the data, the Vallejo-Fairfield, CA area topped the list with a nearly 40% increase in PCPs. Conversely, the Fayetteville, NC area saw the highest decrease – losing nearly 12% of its PCPs. As chronic diseases continue to increase, the density of specialty providers becomes paramount. The provider density analysis examines the number of patients with heart disease per cardiologist in metropolitan statistical areas (MSAs) spanning large, medium, small, and micropolitan areas. The data shows as MSAs get smaller, the number of patients per cardiologist increases substantially, with many rural communities having thousands of heart disease patients per cardiologist. Among major metropolitan areas, Boston has the best ratio with 196 heart disease patients per cardiologist, and Las Vegas has the worst ratio with 824 heart disease patients per cardiologist. Additionally, the analysis found significant degradation of prescriber data in a short period of time. Over a quarter of prescribers (26%) had at least one change in their contact or license information within a 90-day period. This finding is based on the primary location of more than 2 million prescribers and illustrates the potential for data inaccuracies, creating an additional challenge for patients navigating the healthcare ecosystem. "Data is an essential element to fueling healthcare's success, but the continuously changing nature of provider data, when left unchecked, poses a threat to care coordination, patient experience, and health outcomes," said Jonathan Shannon, associate vice president of healthcare strategy, LexisNexis Risk Solutions. "Our recent analysis emphasizes the criticality of ensuring provider information is clean and accurate in real-time. With consistently updated provider data, healthcare organizations can develop meaningful strategies to improve provider availability, equitable access, and patient experience, particularly for vulnerable populations."

Read More

Big Data Management

SAS Introduces SAS Health Transforming Healthcare Data Management

SAS | September 15, 2023

SAS has launched SAS Health, an end-to-end enterprise solution focused on healthcare analytics and data automation. SAS Health is powered by a common health data model with predefined mappings to industry standards. SAS' introduction of SAS Health is part of its $1 billion commitment to invest in AI-powered industry solutions over the next three years. SAS, a globally renowned leader in AI and analytics, has recently unveiled SAS Health, an innovative end-to-end enterprise solution designed for analytics and data automation in the healthcare sector. This innovative platform streamlines health data management, enhances data governance and expedites the generation of valuable patient insights. Within the healthcare industry, the cumbersome process of consolidating data from various systems and formats has been a significant impediment in the development and deployment of scalable healthcare analytic solutions that can benefit both individuals and communities. The patient insights generated through these analytics, ranging from the proactive identification of gaps in clinical staffing to the visualization of screening center distribution relative to the patient population, enable healthcare systems to gauge the quality of each patient interaction and make positive contributions to the care of individuals with complex chronic conditions. In pursuit of a solution to the challenge of providing healthcare providers and payers with centralized, secure, and analytics-optimized data, SAS Health is powered by a common health data model with predefined mappings to widely recognized industry standards. With just a few secure connection details entered, customers can rapidly embark on addressing the most critical aspects of enhancing patient care. Leveraging the capabilities of the analytics and AI platform SAS Viya, SAS Health facilitates the swift extraction of actionable insights, all while ensuring adherence to industry standards and regulations. Gail Stephens, VP of Health Care and Life Sciences at SAS, commented, "Having one consistent, common data model built on a powerful advanced analytics platform is pivotal for hospital systems and the future of health care delivery. SAS Health offers an extraordinary opportunity to advance patient care and treatment through improved efficiencies in data and analytics frameworks, which ultimately will allow health care payers and providers to deliver better outcomes, more quickly." [Source: Cision PR Newswire] SAS Health's common health data model on SingleStore will serve as a central hub for integrating diverse health data with financial, clinical, and operational information, offering an efficient and adaptable approach that reduces costs and simplifies data accessibility. The cloud-native solution will streamline the ingestion of data from multiple industry standards, commencing with the Fast Healthcare Interoperability Resources (FHIR), all in a no-code/low-code format. The global adoption of the FHIR industry data standard, which delineates how healthcare information can be exchanged among various computer systems, continues to grow. Prominent electronic health record (EHR) companies are swiftly embracing FHIR, and in the United States, the Centers for Medicare & Medicaid Services (CMS) have mandated its use. The introduction of SAS Health is one of the outcomes of SAS' recent commitment to invest $1 billion in AI-powered industry solutions over the next three years. This investment, announced in May 2023, builds upon SAS' decades-long dedication to providing tailored solutions for various industries, including government, banking, insurance, retail, manufacturing, healthcare, energy, telecommunications, media, and more, to address their unique challenges effectively.

Read More

Big Data Management

AVEVA Extends Data Capabilities from Edge to Plant to Community with AVEVA PI Data Infrastructure

iTWire | October 30, 2023

AVEVA, is a global leader in industrial software, driving digital transformation and sustainability, has launched AVEVA PI Data Infrastructure, a fully-integrated hybrid data solution providing easy scalability, centralised management, and the ability to share data collaboratively via the cloud. AVEVA PI Data Infrastructure is the latest offering in the market leading AVEVA PI System portfolio, which helps companies collect, enrich, analyse and visualise operations data to achieve deeper insight and operational excellence. Moving to hybrid infrastructure gives industrial companies the flexibility, scalability and security needed to deliver valuable, high-fidelity data to authorised users and applications in any location. The initial release also gives customers the option to use the OpenID Connect protocol for user authentication, enabling enterprise-wide single sign on. Other enterprise-class data management features will be delivered over several releases. AVEVA PI Data Infrastructure makes it easier for companies to collect and use real-time operations data in industrial environments that increasingly include sensor-enabled legacy systems, remote assets and IIoT devices. The hybrid architecture gives data access to more decision makers who rely on operations data to resolve problems and develop business insights, thereby reducing the total cost and effort of operations data management. By achieving seamless data sharing with any trusted collaborator, companies can overcome costly data silos, modernise and streamline user access and aggregate real-time and historical data for wider use and consumption. AVEVA PI Data Infrastructure is available via subscription using AVEVA Flex credits. Harpreet Gulati, SVP - Head of PI System Business at AVEVA, said, No other industrial software company offers a fully-integrated, seamless data infrastructure that enables the fast, secure flow of real-time, high-fidelity data to anywhere it is needed – across multiple plants, at the edge, or in a trusted community over the cloud – with complete data integrity. We want to provide our customers with the flexibility to deploy across any of these areas, enabling them to increase sustainability, operating efficiency, asset reliability, and organisational agility. Customers are embracing the new offering. Giovanna Ruggieri, Head of ICT at Italy’s EP Produzione, a subsidiary of the European energy giant, EPH, commented: "EP Produzione is actively pursuing digital transformation to maximise operational excellence and improve processes to support the business. To continue the journey, and better embrace the digital transformation, we need greater flexibility and integration at all levels, a data infrastructure that can give us full visibility across our multi-site operating environment that always keeps cyber security as high priority. "We appreciate AVEVA PI Data Infrastructure’s aggregate tag subscription model because it allows us to better manage our current and future needs in a smart way, with AVEVA currently proposing, for us, one of the best solutions on the market."

Read More