Cloudera Sets Agenda for Next Era Cloud Data Analytics

It may seem like the story of Hadoop hit a dramatic climax this year as big data workloads went to public cloud giants like AWS and one of the original purveyors of the open source big data technology stack, MapR, narrowly averted shutting down when HPE acquired it.But the real story is one of gradual evolution. Hadoop started out as a technology stack for managing big data, but in the years since the term "Hadoop" faded as the hot tech buzzword, it has become something more a movement toward a modern architecture for managing and analyzing data,said Arun Murthy, chief product officer at Cloudera, and former CPO and a co-founder for Hortonworks, in a post on Medium this week titled Hadoop is Dead. Long live Hadoop. Thats what Cloudera, one of the original three Hadoop distribution companies, and the only remaining one (Hortonworks and Cloudera announced their plans to merge a year ago), has envisioned as it has created its stack of technology aimed at enterprise customers. It's not just Hadoop anymore. The plan is for a collection of open source technologies made available in the cloud to enterprise customers. That has been an evolution. All three of the original Hadoop companies, Cloudera, Hortonworks, and MapR had been moving away from marketing themselves as Hadoop companies for several years already. The Strata + Hadoop Conference changed its name to the Strata Data Conference in 2017. Now a new vision is emerging of large-scale data platforms made up of open source components and based in the cloud.

Spotlight

Other News
Big Data

Airbyte Racks Up Awards from InfoWorld, BigDATAwire, Built In; Builds Largest and Fastest-Growing User Community

Airbyte | January 30, 2024

Airbyte, creators of the leading open-source data movement infrastructure, today announced a series of accomplishments and awards reinforcing its standing as the largest and fastest-growing data movement community. With a focus on innovation, community engagement, and performance enhancement, Airbyte continues to revolutionize the way data is handled and processed across industries. “Airbyte proudly stands as the front-runner in the data movement landscape with the largest community of more than 5,000 daily users and over 125,000 deployments, with monthly data synchronizations of over 2 petabytes,” said Michel Tricot, co-founder and CEO, Airbyte. “This unparalleled growth is a testament to Airbyte's widespread adoption by users and the trust placed in its capabilities.” The Airbyte community has more than 800 code contributors and 12,000 stars on GitHub. Recently, the company held its second annual virtual conference called move(data), which attracted over 5,000 attendees. Airbyte was named an InfoWorld Technology of the Year Award finalist: Data Management – Integration (in October) for cutting-edge products that are changing how IT organizations work and how companies do business. And, at the start of this year, was named to the Built In 2024 Best Places To Work Award in San Francisco – Best Startups to Work For, recognizing the company's commitment to fostering a positive work environment, remote and flexible work opportunities, and programs for diversity, equity, and inclusion. Today, the company received the BigDATAwire Readers/Editors Choice Award – Big Data and AI Startup, which recognizes companies and products that have made a difference. Other key milestones in 2023 include the following. Availability of more than 350 data connectors, making Airbyte the platform with the most connectors in the industry. The company aims to increase that to 500 high-quality connectors supported by the end of this year. More than 2,000 custom connectors were created with the Airbyte No-Code Connector Builder, which enables data connectors to be made in minutes. Significant performance improvement with database replication speed increased by 10 times to support larger datasets. Added support for five vector databases, in addition to unstructured data sources, as the first company to build a bridge between data movement platforms and artificial intelligence (AI). Looking ahead, Airbyte will introduce data lakehouse destinations, as well as a new Publish feature to push data to API destinations. About Airbyte Airbyte is the open-source data movement infrastructure leader running in the safety of your cloud and syncing data from applications, APIs, and databases to data warehouses, lakes, and other destinations. Airbyte offers four products: Airbyte Open Source, Airbyte Self-Managed, Airbyte Cloud, and Powered by Airbyte. Airbyte was co-founded by Michel Tricot (former director of engineering and head of integrations at Liveramp and RideOS) and John Lafleur (serial entrepreneur of dev tools and B2B). The company is headquartered in San Francisco with a distributed team around the world. To learn more, visit airbyte.com.

Read More