IGUAZIO BRINGS ITS DATA SCIENCE PLATFORM TO AZURE AND AZURE STACK

Iguazio, an end-to-end platform that allows data scientists to take machine learning models from data ingestion to training, testing and production, today announced that it is bringing its solution to Microsoft’s Azure cloud and Azure Stack on-premises platform.The 80-person company, which has received a total of $48 million in funding to date, aims to make it easier for data scientists to do the work they are actually paid to do. The company argues that a lot of the work that data scientists do today is about managing the infrastructure and handling integrations, not building the machine learning models.We see that machine learning pipelines are way more complex than people think,” Iguazio CEO Asaf Somekh told me. “People think this is good stuff, but it’s actually horrible. We’re trying to simplify that.”To do this, Iguazio is betting on open source. It uses standard tools and API to pull in data from a wide variety of sources, which is then stored in its real-time in-memory database, which can handle streaming data, as well as time series data, tables and files. It also uses standard Jupyter notebooks instead of some form of proprietary format, but what’s maybe most interesting is that the company also built and open-platform for building data science pipelines. To build the models, Iguazio also uses KubeFlow, a machine learning toolkit for the Kubernetes container platform.Given that Azure and Azure Stack are essentially the same platform, as far as the APIs are concerned, Iguazio can then take its software and run it both in the cloud and on premises. Soon, it’ll also bring its service to Microsoft’s Azure Data Box Edge, Microsoft’s hardware solution for storing and analyzing data at the edge, which can be equipped with FPGAs for deploying machine learning models.

Spotlight

Other News
Big Data

Airbyte Racks Up Awards from InfoWorld, BigDATAwire, Built In; Builds Largest and Fastest-Growing User Community

Airbyte | January 30, 2024

Airbyte, creators of the leading open-source data movement infrastructure, today announced a series of accomplishments and awards reinforcing its standing as the largest and fastest-growing data movement community. With a focus on innovation, community engagement, and performance enhancement, Airbyte continues to revolutionize the way data is handled and processed across industries. “Airbyte proudly stands as the front-runner in the data movement landscape with the largest community of more than 5,000 daily users and over 125,000 deployments, with monthly data synchronizations of over 2 petabytes,” said Michel Tricot, co-founder and CEO, Airbyte. “This unparalleled growth is a testament to Airbyte's widespread adoption by users and the trust placed in its capabilities.” The Airbyte community has more than 800 code contributors and 12,000 stars on GitHub. Recently, the company held its second annual virtual conference called move(data), which attracted over 5,000 attendees. Airbyte was named an InfoWorld Technology of the Year Award finalist: Data Management – Integration (in October) for cutting-edge products that are changing how IT organizations work and how companies do business. And, at the start of this year, was named to the Built In 2024 Best Places To Work Award in San Francisco – Best Startups to Work For, recognizing the company's commitment to fostering a positive work environment, remote and flexible work opportunities, and programs for diversity, equity, and inclusion. Today, the company received the BigDATAwire Readers/Editors Choice Award – Big Data and AI Startup, which recognizes companies and products that have made a difference. Other key milestones in 2023 include the following. Availability of more than 350 data connectors, making Airbyte the platform with the most connectors in the industry. The company aims to increase that to 500 high-quality connectors supported by the end of this year. More than 2,000 custom connectors were created with the Airbyte No-Code Connector Builder, which enables data connectors to be made in minutes. Significant performance improvement with database replication speed increased by 10 times to support larger datasets. Added support for five vector databases, in addition to unstructured data sources, as the first company to build a bridge between data movement platforms and artificial intelligence (AI). Looking ahead, Airbyte will introduce data lakehouse destinations, as well as a new Publish feature to push data to API destinations. About Airbyte Airbyte is the open-source data movement infrastructure leader running in the safety of your cloud and syncing data from applications, APIs, and databases to data warehouses, lakes, and other destinations. Airbyte offers four products: Airbyte Open Source, Airbyte Self-Managed, Airbyte Cloud, and Powered by Airbyte. Airbyte was co-founded by Michel Tricot (former director of engineering and head of integrations at Liveramp and RideOS) and John Lafleur (serial entrepreneur of dev tools and B2B). The company is headquartered in San Francisco with a distributed team around the world. To learn more, visit airbyte.com.

Read More