Big Data Management, Business Strategy

Striim and Databricks Partner to Bring Real-Time Data Integration to the Databricks Lakehouse

Striim | September 21, 2022 | Read time : 02:50 min

Striim
Striim, a global leader in unified real-time data integration and streaming, today announced at the Big Data LDN conference and expo that Striim has joined the Databricks Technology Partner Program. Databricks Technology Partners integrate their solutions with Databricks to provide complementary capabilities for ETL, data ingestion, business intelligence, machine learning, and governance. Striim’s integration with Databricks enables enterprises to leverage the Databricks Lakehouse Platform’s reliability and scalability to innovate faster while deriving valuable data insights in real-time via Striim’s real-time streaming capabilities.

“It’s clear that the businesses that can make accurate, data-driven decisions more quickly have a clear advantage over their competitors. “We intentionally partner with technology providers like Striim to enable our customers to speed their time-to-insight via Databricks AI/ML solutions. We are excited to partner with Striim, providing our customers access to a fully-managed cloud service to seamlessly connect data from databases, applications, and disparate clouds to Databricks in real-time.”

Ariel Amster, director of strategic technology partners at Databricks

The Databricks Lakehouse Platform combines the best elements of data lakes and data warehouses, enabling users to unify their data, analytics, and AI, build on open-source technology, and maintain a consistent platform across clouds. Striim enables Databricks’s AI/ML to create new models that leverage real-time data resulting in far more accurate business predictions. This, in turn, means better decisions more quickly, giving businesses and significant competitive edge.

“In today's digital economy, customer experience, data movement, and data governance require real-time streaming data. Legacy batch processes simply are not enough to meet the demands of today’s AI/ML applications,” said Philip Cockrell, Striim’s senior vice president of Business Development. “Striim’s software-as-a-service offering delivers ‘best-in-class’ capabilities for real-time data integration, helping Databricks customers more fully realize the proven AI/ML functionality Databricks delivers.”

Striim Cloud delivers these capabilities for the enterprise in a managed service format that eliminates the complexity of building low-latency streaming data pipelines at scale. Instead of spending weeks implementing this new infrastructure, global enterprises can now integrate data from disparate sources in just a few clicks.

About Striim
Striim, Inc. is the only supplier of unified, real-time data streaming and integration for analytics and operations in the Digital Economy. Striim Platform and Striim Cloud make it easy to continuously ingest, process, and deliver high volumes of real-time data from diverse sources (both on-premises or in the cloud) to support multi- and hybrid cloud infrastructure. Striim collects data in real time from enterprise databases (using non-intrusive change data capture), log files, messaging systems, and sensors, and delivers it to virtually any target on-premises or in the cloud with sub-second latency enabling real-time operations and analytics.

Spotlight

Integrating analytics into UX work helps to make data-based decisions and focus efforts on projects with the most significant impact.


Other News
Big Data Management

Kinetica Redefines Real-Time Analytics with Native LLM Integration

Kinetica | September 22, 2023

Kinetica, a renowned speed layer for generative AI and real-time analytics, has recently unveiled a native Large Language Model (LLM) integrated with Kinetica's innovative architecture. This empowers users to perform ad-hoc data analysis on real-time, structured data with the ease of natural language, all without the need for external API calls and without data ever leaving the secure confines of the customer's environment. This significant milestone follows Kinetica's prior innovation as the first analytic database to integrate with OpenAI. Amid the LLM fervor, enterprises and government agencies are actively seeking inventive ways to automate various business functions while safeguarding sensitive information that could be exposed through fine-tuning or prompt augmentation. Public LLMs, exemplified by OpenAI's GPT 3.5, raise valid concerns regarding privacy and security. These concerns are effectively mitigated through native offerings, seamlessly integrated into the Kinetica deployment, and securely nestled within the customer's network perimeter. Beyond its superior security features, Kinetica's native LLM is finely tuned to the syntax and industry-specific data definitions, spanning domains such as telecommunications, automotive, financial services, logistics, and more. This tailored approach ensures the generation of more reliable and precise SQL queries. Notably, this capability extends beyond conventional SQL, enabling efficient handling of intricate tasks essential for enhanced decision-making capabilities, particularly for time-series, graph, and spatial inquiries. Kinetica's approach to fine-tuning places emphasis on optimizing SQL generation to deliver consistent and accurate results, in stark contrast to more conventional methods that prioritize creativity but yield diverse and unpredictable responses. This steadfast commitment to reliable SQL query outcomes offers businesses and users the peace of mind they deserve. Illustrating the practical impact of this innovation, the US Air Force has been collaborating closely with Kinetica to leverage advanced analytics on sensor data, enabling swift identification and response to potential threats. This partnership contributes significantly to the safety and security of the national airspace system. The US Air Force now employs Kinetica's embedded LLM to detect airspace threats and anomalies using natural language. Kinetica's database excels in converting natural language queries into SQL, delivering responses in mere seconds, even when faced with complex or unfamiliar questions. Furthermore, Kinetica seamlessly combines various analytics modes, including time series, spatial, graph, and machine learning, thereby expanding the range of queries it can effectively address. What truly enables Kinetica to excel in conversational query processing is its ingenious use of native vectorization. In a vectorized query engine, data is organized into fixed-size blocks called vectors, enabling parallel query operations on these vectors. This stands in contrast to traditional approaches that process individual data elements sequentially. The result is significantly accelerated query execution, all within a smaller compute footprint. This remarkable speed is made possible by the utilization of GPUs and the latest CPU advancements, which enable simultaneous calculations on multiple data elements, thereby greatly enhancing the processing speed of computation-intensive tasks across multiple cores or threads. About Kinetica Kinetica is a pioneering company at the forefront of real-time analytics and is the creator of the groundbreaking real-time analytical database specially designed for sensor and machine data. The company offers native vectorized analytics capabilities in the fields of generative AI, spatial analysis, time-series modeling, and graph processing. A distinguished array of the world's largest enterprises spanning diverse sectors, including the public sector, financial services, telecommunications, energy, healthcare, retail, and automotive industries, entrusts Kinetica to forge novel solutions in the realms of time-series data and spatial analysis. The company's clientele includes various illustrious organizations such as the US Air Force, Citibank, Ford, T-Mobile, and numerous others.

Read More

Data Visualization

Airbyte Integrates with Datadog for Superior Data Pipeline Management

Airbyte | September 11, 2023

Airbyte, the leading open-source data movement platform, has announced a strategic integration with Datadog, Inc., a prominent cloud application monitoring and security platform. This integration offers customers a comprehensive solution to monitor and analyze data pipelines with access to nearly 50 metrics, all at no additional cost. The integration between Airbyte Self-Managed and Datadog's data observability and security monitoring capabilities allows organizations to maintain a close watch on the health of their critical data pipelines. Key features of this integration include: A centralized overview of Airbyte data pipeline performance Real-time detection and immediate alerts for failing syncs or connections Notifications regarding long-running jobs, which could indicate potential latency issues Michel Tricot, CEO of Airbyte, emphasized the significance of this integration, stating, The new Datadog integration provides transparency and actionable insights, empowering users to optimize performance and ensure reliable data pipelines by proactively addressing potential data issues. [Source: Business Wire] Yrieix Garnier, Vice President of Product at Datadog, further elaborated on the benefits, explaining, Airbyte's data extraction and loading process involves numerous complex components. The integration with Datadog offers users peace of mind, enabling them to monitor data pipelines across their organization and troubleshoot any potential data integration workflow issues, ultimately ensuring data quality. [Source: Business Wire] This integration will be immediately available to users. Existing Datadog customers can configure their Airbyte deployments to send metrics to Datadog. For those not already using Datadog, a free trial is available. Similarly, users new to Airbyte can sign up for free. Airbyte continues its commitment to delivering robust data integration and analysis solutions to organizations. The Datadog integration represents a significant milestone in Airbyte's mission to empower businesses with efficient data integration capabilities. Airbyte simplifies data movement across various sources and destinations, making it accessible and cost-effective for enterprises. With the largest data engineering contributor community, boasting over 800 contributors as well as top-tier tools for connector development and maintenance, Airbyte remains at the forefront of the data integration landscape. About Airbyte Founded in 2020, Airbyte is an open-source platform for EL (T) that enables data teams to replicate data from various sources to different destinations. The company, which has raised $181 million in funding, believes in the power of open source to address data integration challenges and offers over 200 connectors for data syncing. Currently, it serves over 25,000 companies.

Read More

Business Intelligence

Dremio Launches Next-gen Reflections Redefining SQL Query Acceleration

Dremio | September 15, 2023

Dremio, a renowned easy and open data lakehouse solution provider, has recently introduced its next-gen Reflections technology, marking a transformative milestone in SQL query acceleration. Dremio Reflections facilitate sub-second analytics performance across an organization's entire data ecosystem, irrespective of data location. This groundbreaking technology is redefining data access and analysis, ensuring that valuable insights are derived efficiently and swiftly, all while reducing costs to merely one-third of a typical cloud data warehouse. Reflections represent Dremio's innovative SQL query acceleration technology. Queries that leverage Reflections exhibit performance gains ranging from 10 to 100 times faster than their non-accelerated counterparts. This latest release introduces the Dremio Reflection Recommender, a pioneering feature that empowers users to accelerate Business Intelligence workloads in a matter of seconds. The Reflection Recommender automatically evaluates an organization's SQL queries and generates recommended Reflections to accelerate them. Tomer Shiran, founder of Dremio, commented, Dremio Reflections accelerate SQL queries by orders of magnitude, eliminating the need for BI extracts/imports and enabling companies to run their most mission-critical BI workloads directly on a lakehouse. With automatic recommendations and next-generation incremental updates, we've made it even easier for organizations to take advantage of this innovative technology. [Source: Business Wire] Reflection Recommender eliminates the need for labor-intensive manual data and workload analysis, making the process of obtaining the fastest and most intelligent query results effortless, requiring only a few simple actions. The user-friendly nature of the Reflection Recommender puts advanced query acceleration capabilities within the reach of all users, significantly saving both time and expenses. Dremio has also refined the process of refreshing Reflections to further bolster query performance and drive cost efficiencies. It now intelligently refreshes Reflections on Apache Iceberg tables, promptly capturing incremental data changes. This innovative approach obviates the requirement for complete data refreshes, resulting in speedier updates and reduced compute expenses. Dremio Reflections eliminates the need for data teams to export data from the data lakehouse into BI extracts or imports for analytical reasons and overcomes performance bottlenecks for BI dashboards and reports. In addition, Reflections negate the necessity of creating precomputed tables within the data lake or data warehouse to achieve sub-second performance for BI workloads, reducing the workload and complexity for data teams. About Dremio Dremio is a leading, easy and open data lakehouse solution provider, offering organizations the versatility of self-service analytics coupled with the functionality of a data warehouse and the flexibility of a data lake. Dremio's platform empowers users to harness the lightning-fast SQL query service alongside various processing engines, all on the same dataset. The company distinguishes itself through a pioneering data-as-code methodology akin to Git, which facilitates data experimentation, version control, and governance. This innovative approach enhances agility and empowers organizations to explore and manage their data resources with unprecedented efficiency. Furthermore, Dremio offers a fully managed service that expedites organizations' entry into analytics, allowing them to commence their data-driven journey within minutes.

Read More

Big Data Management

iMerit Unveils Innovative Solution for Radiology AI Data Challenges

iMerit | September 07, 2023

iMerit, a prominent player in the field of artificial intelligence (AI) data solutions, has unveiled its latest offering, the Radiology Annotation Product Suite. This innovative suite is designed to cater to the needs of medical AI developers by providing advanced automation, annotation, and analytics capabilities. This new product suite is firmly rooted in iMerit's Ango Hub platform, an end-to-end enterprise-grade technology platform that is specifically tailored to deliver top-notch data annotation tools for AI development teams. Within this suite, a comprehensive range of solutions awaits, including data sourcing, workflow design, cutting-edge data annotation tools, and the invaluable input of human experts, all seamlessly integrated into a single platform. This unique fusion of iMerit's technological prowess and radiology expertise ensures a smooth journey from training data all the way to regulatory benchmarking. A significant hurdle in radiology AI applications is the demand for specialized tools and insights from domain experts to ensure the necessary accuracy and precision in training data. For developers, the Radiology Annotation Product Suite offers a one-stop solution, combining automation, annotation tools, and analytics within a single platform, facilitating the creation of precise data pipelines essential for quick scaling radiology AI solutions into production. Sina Bari MD, Senior Director of Medical AI at iMerit, explained that their suite aims to combine human expertise, data management, and automation in a single solution. Notable features include Customized workflows designed for Radiologists with consensus and multistep capabilities. Diagnostic-level annotation accuracy with multiplanar functionality and 3D volume rendering. Efficient annotation facilitated by smart automation tools and model integration. Stringent data security and regulatory compliance, including CFR 21 Part 11, HIPAA, and SOC2. This solution seamlessly integrates U.S. and offshore experts, including board-certified radiologists, all within a highly secure end-to-end platform, ensuring cost-effective scaling of annotation efforts. Radha Basu, Founder and CEO of iMerit, emphasized the importance of their work, stating that they understand the challenges of developing AI applications and that their suite was built to scale data annotation efforts with high quality, accuracy, and speed. About iMerit iMerit is a prominent AI data solutions company specializing in data services such as dataset creation, image tagging, sentiment analysis, data verification, and content aggregation. It serves Fortune 500 enterprises in various industries and have a global presence, with headquarters in the United States and teams in India, the US, Bhutan, and Europe. iMerit's investors include Omidyar Network, Dell.org, Khosla Ventures, and British International Investment.

Read More