Big Data Management

dotData Integrates Automated Feature Engineering with Databricks Platform

A leader in full-cycle enterprise AI automation solutions, dotData, announced today the integration of its award-winning Automated Feature Engineering (AutoFE) technology with Databricks Platform and is available on the same. With this integration of dotData’s AutoFE and Databricks Platform, Databricks users can now explore 100x more features and boost model accuracy quickly, augmenting domain features with hundreds of AI features.

dotData’s AutoFE is completely integrated with Python data science workflow and explores millions of features from relational, transactional, geo-locational, and text data. It deals with multi-relational tables with billions of records and builds an ML-ready feature table just in hours.

Advantages of dotData’s AutoFE and Databricks platform integration include:

• All features and functionality of dotData's award-winning automated feature engineering and AutoML
• Leverages Databricks runtime to maximize the speed of feature engineering
• Compatibility with Databricks tool ecosystem, e.g. manage dotData's AI features by Databricks' Feature Store
• Installed as a library, requiring no changes on the existing Databricks Python workbench

dotData automates feature engineering, the most manual and time-consuming step in AI and ML projects. The hidden patterns behind hundreds of tables with complex relationships and billions of rows and AI features for your AI and ML algorithms can be easily discovered with dotData’s exclusive AI technology. Feature engineering, until now, has 100 per cent relied on the intuition and experience of domain experts and data scientists. dotData enables users to leverage AI to discover unknown unknowns and build greater AI and ML models.

Data science teams with experience can leverage dotData’s AI features to enhance in-house developed features. AutoFE helps users with rapid and automated methods to rapidly prototype use cases, explore new datasets to find significant patterns, and improve the accuracy of AI and ML models. It is available as a Python library seamlessly integrated with your existing Python workflow and cuts 80 per cent of the time to develop features for your AI and ML models. 

About dotData
dotData pioneered Automated Feature Engineering to accelerate and augment the process of building AI/ML models, to drive higher business value for the enterprise. dotData ingests raw business data and uses an AI-based engine to automatically discover meaningful insights and build ML-ready feature tables from relational, transactional, temporal, geo-locational, and text data. dotData's scalable, the flexible platform enables data scientists to discover and evaluate outstanding AI features; and empowers business intelligence professionals to Add AI/ML models to their BI stacks and predictive analytics applications quickly and easily. Fortune 500 organizations around the world use dotData to accelerate their ML and AI development to drive higher business value.

Spotlight

Spotlight

Related News

Big Data

Airbyte Racks Up Awards from InfoWorld, BigDATAwire, Built In; Builds Largest and Fastest-Growing User Community

Airbyte | January 30, 2024

Airbyte, creators of the leading open-source data movement infrastructure, today announced a series of accomplishments and awards reinforcing its standing as the largest and fastest-growing data movement community. With a focus on innovation, community engagement, and performance enhancement, Airbyte continues to revolutionize the way data is handled and processed across industries. “Airbyte proudly stands as the front-runner in the data movement landscape with the largest community of more than 5,000 daily users and over 125,000 deployments, with monthly data synchronizations of over 2 petabytes,” said Michel Tricot, co-founder and CEO, Airbyte. “This unparalleled growth is a testament to Airbyte's widespread adoption by users and the trust placed in its capabilities.” The Airbyte community has more than 800 code contributors and 12,000 stars on GitHub. Recently, the company held its second annual virtual conference called move(data), which attracted over 5,000 attendees. Airbyte was named an InfoWorld Technology of the Year Award finalist: Data Management – Integration (in October) for cutting-edge products that are changing how IT organizations work and how companies do business. And, at the start of this year, was named to the Built In 2024 Best Places To Work Award in San Francisco – Best Startups to Work For, recognizing the company's commitment to fostering a positive work environment, remote and flexible work opportunities, and programs for diversity, equity, and inclusion. Today, the company received the BigDATAwire Readers/Editors Choice Award – Big Data and AI Startup, which recognizes companies and products that have made a difference. Other key milestones in 2023 include the following. Availability of more than 350 data connectors, making Airbyte the platform with the most connectors in the industry. The company aims to increase that to 500 high-quality connectors supported by the end of this year. More than 2,000 custom connectors were created with the Airbyte No-Code Connector Builder, which enables data connectors to be made in minutes. Significant performance improvement with database replication speed increased by 10 times to support larger datasets. Added support for five vector databases, in addition to unstructured data sources, as the first company to build a bridge between data movement platforms and artificial intelligence (AI). Looking ahead, Airbyte will introduce data lakehouse destinations, as well as a new Publish feature to push data to API destinations. About Airbyte Airbyte is the open-source data movement infrastructure leader running in the safety of your cloud and syncing data from applications, APIs, and databases to data warehouses, lakes, and other destinations. Airbyte offers four products: Airbyte Open Source, Airbyte Self-Managed, Airbyte Cloud, and Powered by Airbyte. Airbyte was co-founded by Michel Tricot (former director of engineering and head of integrations at Liveramp and RideOS) and John Lafleur (serial entrepreneur of dev tools and B2B). The company is headquartered in San Francisco with a distributed team around the world. To learn more, visit airbyte.com.

Read More