Big Data Management

Cloudian Partners with WEKA to Deliver High-Performance, Exabyte-Scalable Storage for AI, Machine Learning and Other Advanced Analytics

Cloudian® today announced the integration of its HyperStore® object storage with the WEKA Data Platform for AI, providing high-performance, exabyte-scalable private cloud storage for processing iterative analytical workloads. The combined solution unifies and simplifies the data pipeline for performance-intensive workloads and accelerated DataOps, all easily managed under a single namespace. In addition, the new solution reduces the storage TCO associated with data analytics by a third, compared to traditional storage systems.

Advanced Analytics Workloads Create Data Storage Challenges
Organizations are consuming and creating more data than ever before, and many are applying AI, machine learning (ML) and other advanced analytics on these large data sets to make better decisions in real-time and unlock new revenue streams. These analytics workloads create and use massive data sets that pose significant storage challenges, most importantly the ability to manage the data growth and enable users to extract timely insights from that data. Traditional storage systems simply can’t handle the processing needs or the scalability required for iterative analytics workloads and introduce bottlenecks to productivity and data-driven decision making.

Cloudian-WEKA Next Generation Storage Platform
Together, Cloudian and WEKA enable organizations to overcome the challenges of accelerating and scaling their data pipelines while lowering data analytics storage costs. WEKA’s data platform, built on WekaFS, addresses the storage challenges posed by today’s enterprise AI workloads and other high-performance applications running on-premises, in the cloud or bursting between platforms. The joint solution offers the simplicity of NAS, the performance of SAN or DAS and the scale of object storage, along with accelerating every stage of the data pipeline from data ingestion to cleansing to modeled results.

Integrated through WEKA’s tiering function, Cloudian’s enterprise-grade, software-defined object storage provides the following key benefits:

  • High Performance – Run concurrent workloads while eliminating compute cluster bottlenecks and reducing processing times.
  • Exabyte Scalability – Grow deployments on demand, from terabytes to an exabyte without disruption, achieving the flexibility and elasticity of the public cloud within a private data center or hybrid cloud model.
  • Enterprise-grade Security – Protect data with encryption in flight and at rest, integrated firewall, RBAC/IAM and SAML access controls, and certification with the most rigorous regulatory requirements, such as Common Criteria, FIPS and SEC Rule 17a-4(f).
  • Resiliency – Achieve high data durability with the option to protect and distribute data using replication or erasure coding, thereby eliminating the need for a separate data backup process.
  • Multi-tenancy – Provision multiple users on shared infrastructure without compromising security.
  • Cost-effective – Save on storage costs, as the solution runs on standard x86 hardware with local NVMe SSDs.
“As organizations increasingly employ AI, ML and other advanced analytics to extract greater value from their data, they need a modern storage platform that enables fast, easy data processing and management,” said Jonathan Martin, president, WEKA. “The combination of the WEKA Data Platform and Cloudian object storage provides an ideal solution that can seamlessly and cost-effectively scale to meet growing demands.”

“When it comes to supporting advanced analytics applications, users shouldn’t have to make tradeoffs between storage performance and capacity,” said Jon Toor, chief marketing officer, Cloudian. “By eliminating any need to compromise, the integration of our HyperStore software with the WEKA Data Platform gives customers a storage foundation that enables them to fully leverage these applications so they can gain new insights from their data and drive greater business and operational success.”

About Cloudian
Cloudian is the most widely deployed independent provider of object storage. With a native S3 API, it brings the scalability and flexibility of public cloud storage into the data center while providing ransomware protection and reducing TCO by up to 70% compared to traditional SAN/NAS and public cloud. The geo-distributed architecture enables users to manage and protect object and file data across sites—on-premises and in the cloud—from a single platform.

Spotlight

Spotlight

Related News

Big Data

Airbyte Racks Up Awards from InfoWorld, BigDATAwire, Built In; Builds Largest and Fastest-Growing User Community

Airbyte | January 30, 2024

Airbyte, creators of the leading open-source data movement infrastructure, today announced a series of accomplishments and awards reinforcing its standing as the largest and fastest-growing data movement community. With a focus on innovation, community engagement, and performance enhancement, Airbyte continues to revolutionize the way data is handled and processed across industries. “Airbyte proudly stands as the front-runner in the data movement landscape with the largest community of more than 5,000 daily users and over 125,000 deployments, with monthly data synchronizations of over 2 petabytes,” said Michel Tricot, co-founder and CEO, Airbyte. “This unparalleled growth is a testament to Airbyte's widespread adoption by users and the trust placed in its capabilities.” The Airbyte community has more than 800 code contributors and 12,000 stars on GitHub. Recently, the company held its second annual virtual conference called move(data), which attracted over 5,000 attendees. Airbyte was named an InfoWorld Technology of the Year Award finalist: Data Management – Integration (in October) for cutting-edge products that are changing how IT organizations work and how companies do business. And, at the start of this year, was named to the Built In 2024 Best Places To Work Award in San Francisco – Best Startups to Work For, recognizing the company's commitment to fostering a positive work environment, remote and flexible work opportunities, and programs for diversity, equity, and inclusion. Today, the company received the BigDATAwire Readers/Editors Choice Award – Big Data and AI Startup, which recognizes companies and products that have made a difference. Other key milestones in 2023 include the following. Availability of more than 350 data connectors, making Airbyte the platform with the most connectors in the industry. The company aims to increase that to 500 high-quality connectors supported by the end of this year. More than 2,000 custom connectors were created with the Airbyte No-Code Connector Builder, which enables data connectors to be made in minutes. Significant performance improvement with database replication speed increased by 10 times to support larger datasets. Added support for five vector databases, in addition to unstructured data sources, as the first company to build a bridge between data movement platforms and artificial intelligence (AI). Looking ahead, Airbyte will introduce data lakehouse destinations, as well as a new Publish feature to push data to API destinations. About Airbyte Airbyte is the open-source data movement infrastructure leader running in the safety of your cloud and syncing data from applications, APIs, and databases to data warehouses, lakes, and other destinations. Airbyte offers four products: Airbyte Open Source, Airbyte Self-Managed, Airbyte Cloud, and Powered by Airbyte. Airbyte was co-founded by Michel Tricot (former director of engineering and head of integrations at Liveramp and RideOS) and John Lafleur (serial entrepreneur of dev tools and B2B). The company is headquartered in San Francisco with a distributed team around the world. To learn more, visit airbyte.com.

Read More