DATA SCIENCE

Saturn Cloud and Bodo.ai Partner to Bring Extreme Performance Python to Data Scientists

Saturn Cloud | June 02, 2022

Saturn Cloud
Saturn Cloud, the data science and machine learning platform and bodo.ai, a parallel data compute platform providing extreme scale and speed for Python, have announced their partnership to take Python analytics performance to the next level for data science teams.

Data scientists develop multiple workflows across teams, and rely on Saturn Cloud to provide a collaborative environment and computing resources. With this partnership, those teams now have seamless access to the Bodo platform - allowing them to scale prototypes to petabyte-scale parallel-processing production without any tuning or re-coding.

Saturn Cloud's pre-built tools allow data science teams to collaborate and scale easily, without locking users into patterns. Instead, the platform encourages the workflow the user already has, while providing an environment where they don't need to rely on dev sources or manage compute environments. It prioritizes keeping the data scientist self-sufficient, while being able to collaborate and share work more efficiently.

Bodo offers a parallel compute platform providing extreme scale and speed, but with the simplicity and flexibility of using native Python. In contrast to using libraries and frameworks like Spark, Bodo is a new type of compiler offering automatic parallelism and high efficiency surpassing 10,000+ cores. Bodo can also be used natively with analytics packages such as Pandas, NumPy, SciKit Learn, and more.

The joint solution is available immediately, with bodo.ai software running within Saturn Cloud resources. Saturn Cloud provides a pre-built template with Bodo already installed and configured. Then, users are able to access the functionality of bodo.ai within JupyterLab or via SSH from VSCode, PyCharm, or the terminal. By using Saturn Cloud, users are able to get up to 4TB of RAM and 128 vCPUs, all backing the powerful software of Bodo.

You can try the following examples right away here: Use Bodo to speed up feature engineering and model training or use Bodo to speed up data manipulation and analysis.

"Our partnership is focused on providing massive speed and productivity improvements to data scientists struggling with large-scale analytics projects. Bodo's platform adds terabyte-scale processing with unheard-of infrastructure efficiencies for Saturn Cloud users."

Behzad Nasre, CEO, Bodo

"We not only want to provide a flexible workspace for data science teams, but enable greater Python scaling capabilities to increase productivity in projects that are more demanding. This joint offering with Bodo will give users an opportunity to take their work to the next level with automatic parallelization for better overall performance," says Sebastian Metti, one of the Saturn Cloud founders.

About Saturn Cloud
Saturn Cloud is a data science and machine learning platform flexible enough for any team. Collaborate together in the cloud on analyses and model training, then deploy your code. All using the same patterns you're used to, but with cloud scale. Learn more here.

About Bodo
Founded in 2019, Bodo.ai is an extreme-performance parallel compute platform for data analytics, scaling past 10,000 cores and petabytes of data with unprecedented efficiency and linear scaling. Leveraging automatic parallelization and the first inferential compiler, Bodo is helping F500 customers solve some of the world's largest data analysis problems. And doing so in a fraction of traditional time, complexity, and cost, all while leveraging the simplicity and flexibility of native Python. Developers can deploy Bodo on any infrastructure, from a laptop to a public cloud.

Spotlight

Where business intelligence provide answers to known questions, big data discovery reveals unknown patterns, relationships and insights in any data. Leveraging the power of Hadoop, Datameer makes it easy for anyone to quickly discover insights in any data, right away.


Other News
BIG DATA MANAGEMENT

Nordisk Film Adopts Qlik Cloud Analytics for Operational Efficiencies and Reduced Costs

Qlik | August 10, 2022

Qlik® today announced Nordisk Film, the Nordic region's leading creator and distributor of films, has adopted Qlik Cloud® Analytics to realize operational efficiencies, reduce costs associated with data preparation and analysis, and ultimately expand data-decision making throughout the organization. Nordisk Film is known as an industry innovator at the forefront of adopting modern technology and solutions that foster new, improved ways of working. Over the last five years, Nordisk Film has been on a journey to deploy strategies that meet modern consumer demands while leveraging fact-based decisions. Nordisk Film’s prior business intelligence systems required significant maintenance, service, and dedicated storage space and memory. This made the system difficult for users to work with and limited Nordisk’s ability to scale the use of data for decisions. Nordisk Film was also looking to increase collaboration and streamline its different computer systems and data warehouses into a single cloud platform. Nordisk Film chose to move to Qlik Cloud for importing, clearing and analyzing data in order to make more informed business decisions while leveraging Qlik Sense® SaaS. “Previously we spent a lot of time with maintenance and making sure our internal systems worked. “We migrated our old local platform to a Qlik environment, saving us time and allowing us to take advantage of the latest technical developments and improve our data structure for a more mature approach to analytics.” Mikkel Hecht Hansen, Head of BI at Nordisk Film For Nordisk Film, it is important to have a cost-effective and scalable platform, while also being able to leverage modern analytics capabilities such as augmented analytics and mobile access. “Qlik has given us a completely different dimension of new knowledge and opportunities through, among other things, Insight Advisor. Along with an easy and simple security login through Azure AD, Qlik gives us many insights and data-driven facts that help us make better decisions,” said Hansen. Insight Advisor is the AI assistant built into Qlik Sense that generates advanced analytics and insights using natural language interaction for Nordisk analytics users. Another key innovation in Qlik’s platform that is bringing value to Nordisk is Collaborative Notes, which allows employees to comment or write longer reports directly in the analytics environment. And Qlik being easy-to-learn and applicable to many different business areas has helped Nordisk Film expand analytics adoption across the business. “Nordisk Film is a great example of an incredible brand that is leveraging Qlik Cloud to accelerate its transformation into a data-driven business,” said Francisco Mateo-Sidron, Senior Vice President EMEA for Qlik. “We look forward to helping Nordisk continue to expand its ability to leverage cloud analytics for impact throughout the entire organization.” About Nordisk Film Nordisk Film is a leading Nordic entertainment and experience company focused on storytelling across platforms. We produce, market and distribute film and series, operate a leading Nordic cinema chain, is behind global game studios and PlayStation in the Nordics, and delivers digital gift card solutions to the world. Nordisk Film is a part of the leading Nordic media group Egmont, together with Story House Egmont, TV 2 in Norway, Lindhardt og Ringhof og Cappelen Damm. Egmont is a foundation, and all profits are used to develop media and to help children and young people and support film talents. We bring stories to life. About Qlik Qlik’s vision is a data-literate world, where everyone can use data and analytics to improve decision-making and solve their most challenging problems. A private company, Qlik offers real-time data integration and analytics solutions, powered by Qlik Cloud®, to close the gaps between data, insights and action. By transforming data into Active Intelligence, businesses can drive better decisions, improve revenue and profitability, and optimize customer relationships. Qlik serves more than 38,000 active customers in over 100 countries.

Read More

BUSINESS STRATEGY

ThoughtSpot and Matillion Partner to Enable Rapid Delivery of Insights with Low-Code Data Integration and Live Analytics SpotApps

ThoughtSpot | June 15, 2022

ThoughtSpot, the Modern Analytics Cloud company, and Matillion, the leading enterprise cloud data integration platform announced today their joint partnership to provide data model templates and Live Analytics to help data teams working in the cloud get up-to-date data insights in minutes. Organizations can leverage ThoughtSpot SpotApps, prebuilt solutions for specific use cases powered by Matillion’s data transformation platform, to accelerate time to value and give more users access to the data they need. Companies of all sizes struggle to turn their dynamic, distributed, and diverse data into insights, and those insights into actions. The data pipelines built in the last decade fail to deliver the agility, flexibility, or intuitiveness required by modern businesses. Often, the go-to solution to migrate data into the cloud involves manually extracting and preparing the data, a code-intensive process that is complex and time-consuming. This is exacerbated by intensive, complex analytics platforms that require deep specialization to unearth insights from this data once it’s in a cloud platform. Data teams need to be able to rapidly ingest, transform, and empower their entire organization to analyze data to meet business demands. Together, ThoughtSpot and Matillion reduce development time needed to build data pipelines and launch new analytics use cases, helping business users easily access data and enabling data engineers to focus on more complex data projects. Matillion’s low-code/no-code ELT templates via Shared Jobs solve this problem, offering users a cloud-native tool to extract, migrate, and transform data from any source, while ThoughtSpot makes it possible for anyone to analyze this data through search and AI. With Matillion, ThoughtSpot users can move data directly into their data cloud platform and quickly prepare it for analytics. As soon as the data is available, ThoughtSpot’s guided SpotApp configuration process makes it easy for users to stand up new Live Analytics use cases in seconds. SpotApps make getting up and running with use case templates simple for the most common SaaS applications such as ServiceNow, HubSpot, Okta, Google Analytics, and more with out-of-the-box ThoughtSpot Modeling Language (TML)-based worksheets, tables and Liveboards. “Before building our modern data stack with Matillion, Snowflake, and ThoughtSpot, Sargento was spending excessive amounts of time in data preparation and manual reporting in Excel. We set out on a mission to find a best-of-breed cloud-first analytics solution to extract the business most value from the massive amounts of data we had in front of us,” explained Travis Lehn, Senior Manager of Data & Analytics at Sargento Foods, Inc. “After a rigorous selection process, we ultimately chose a stack that scales analytics across our business and provides self-service capabilities to our diverse user base.” “ThoughtSpot’s experience layer capabilities empower anyone to use search and AI to uncover powerful business insights with Live Analytics and are the perfect pairing for the data extraction, transformation and readiness that Matillion provides in the ELT layer. “Through this partnership, we are dramatically reducing the stress and headache of building data pipelines and launching new analytics use cases, accelerating time to value for our customers in their data consumption and modern data stack journeys.” Kuntal Vahalia, SVP of Worldwide Channel and Alliances “Today’s modern data teams are burdened with the maintenance, migration, and preparation of data to perform analytics. Matillion and Thoughtspot alleviate that burden using low-code and no-code templates to accelerate time to insights,” said Ciaran Dynes, Chief Product Officer at Matillion. “Now data teams can focus on innovation to use data for advanced use cases to drive the business forward.” About ThoughtSpot ThoughtSpot is the Modern Analytics Cloud company. Our mission is to create a more fact-driven world with the easiest to use analytics platform. With ThoughtSpot, anyone can leverage natural language search and AI to find data insights and tap into the most cutting edge innovations the cloud data ecosystem has to offer. Companies can put the power of their modern data stack in the hands of every employee, extend the value of their data to partners and customers, and automate entire business processes. ThoughtSpot enables everyone within an organization to limitlessly engage with live data regardless of their cloud data platform, making it easy to achieve granular, actionable insights through Live Analytics. Customers can take advantage of ThoughtSpot’s web and mobile applications to improve decision making for every employee. With ThoughtSpot’s developer-friendly platform, ThoughtSpot Everywhere, customers can also bring the Modern Analytics Cloud to their products and services, engaging users and keep them coming back for more. Organizations like Walmart, BT, T-Mobile, Snowflake, HubSpot, Exxon, Daimler, Medtronic, Hulu, Royal Bank of Canada, Nasdaq, OpenTable, Workato, and Nationwide Building Society rely on ThoughtSpot to transform how their employees and customers take advantage of data. About Matillion Matillion makes the world's data useful with an easy-to-use, cloud-native data integration and transformation platform. Optimized for modern enterprise data teams, only Matillion is built on native integrations to cloud data platforms such as Snowflake, Delta Lake on Databricks, Amazon Redshift, Google BigQuery, and Microsoft Azure Synapse to enable new levels of efficiency and productivity across any organization.

Read More

DATA SCIENCE

KNIME Accelerates Data Science Democratization Through Snowflake Collaboration

KNIME | June 10, 2022

KNIME, the open source data science company, today announced a strategic partnership with Snowflake, the Data Cloud company, to democratize access to data analytics across all roles and departments. Understanding data is critical for creating business value. With the global data analytics market worth more than $200 billion, it’s necessary for as many people as possible across roles, departments and industries to have access to analytics in their daily jobs for overall better productivity. “Many of our customers rely on Snowflake to power virtually any data workload at scale, while utilizing KNIME to gain value from that data.” Paul Treichler, VP of global partnerships at KNIME Tarik Dwiek, Snowflake’s head of technology partnerships, added, “In partnership with KNIME, we look to enrich the Snowflake ecosystem with tools that can enable an even greater share of enterprises and both technical and non-technical users of data.” The joint offering means that users can access and manipulate data in Snowflake with a low-/no-code platform at no cost. KNIME Analytics Platform is a fully featured analytics workflow “designer” that can be used in conjunction with Snowflake’s Data Cloud to perform a broad range of analytics tasks from data prep to data science. Users can leverage the drag-and-drop interface to prepare and explore data, rapidly build analytical models, create data apps, and present results in BI tools such as Tableau or Power BI. KNIME is flexible and extensible, giving data experts the freedom to work in their preferred environment. Users can build sophisticated analytic models in its low-code/no-code environment or script custom algorithms in a language of their choice with built-in integrations with R, Python, Java and more. KNIME has a vibrant open source community of users who share their knowledge and expertise in specialized forums. Technical and non-technical teams can make use of this community to leverage pre-built components and workflows to accelerate their time to value and also upskill themselves through comprehensive free training and learning content available from KNIME. Upskilling non-technical teams to use data science and analytics leaves technical teams with greater bandwidth and freedom to concentrate on more complex tasks. Across industries, enterprises can also take advantage of KNIME’s commercial offering. KNIME Server offers a suite of features for automation, governance, production deployment and MLOps. Snowflake working in concert with KNIME Server enables organizations to move beyond pilot projects and build enterprise-scale data solutions that are compliant and accessible across the organization. Lastly, KNIME extends the deployment flexibility of Snowflake to the analytics layer, allowing enterprises to utilize the right resources for a given workload or scenario. “We are excited about the partnership between Snowflake and KNIME," said Ryan Bosshart, CEO of phData, the Snowflake 2021 RSI Partner of the Year and KNIME Elite Partner. “We've been building with both Snowflake and KNIME because we believe in platforms and technology that make it easier for people to build data products, in both business and technical roles. I’m excited to see what new use cases are possible with this combination.” About KNIME KNIME helps individuals and organizations make sense of data. KNIME software bridges the worlds of dashboards and advanced analytics through an intuitive interface, appropriate for anybody working with data. It empowers more business experts to be self-sufficient and more data experts to push the business to the bleeding edge of modern data science, integrating the latest AI and machine learning techniques. KNIME is distinct in its open approach, which ensures easy adoption and future-proof access to new technologies.

Read More

BIG DATA MANAGEMENT

Komprise Automates Unstructured Data Discovery with Smart Data Workflows

Komprise | May 20, 2022

Komprise, the leader in analytics-driven unstructured data management and mobility, today announced Komprise Smart Data Workflows, a systematic process to discover relevant file and object data across cloud, edge and on-premises datacenters and feed data in native format to AI and machine learning (ML) tools and data lakes. Industry analysts predict that at least 80% of the world’s data will be unstructured by 2025. This data is critical for AI and ML-driven applications and insights, yet much of it is locked away in disparate data storage silos. This creates an unstructured data blind spot, resulting in billions of dollars in missed big data opportunities. Komprise has expanded Deep Analytics Actions to include copy and confine operations based on Deep Analytics queries, added the ability to execute external functions such as running natural language processing functions via API and expanded global tagging and search to support these workflows. Komprise Smart Data Workflows allow you to define and execute a process with as many of these steps needed in any sequence, including external functions at the edge, datacenter or cloud. Komprise Global File Index and Smart Data Workflows together reduce the time it takes to find, enrich and move the right unstructured data by up to 80%. “Komprise has delivered a rapid way to visualize our petabytes of instrument data and then automate processes such as tiering and deletion for optimal savings,” says Jay Smestad, senior director of information technology at PacBio. “Now, the ability to automate workflows so we can further define this data at a more granular level and then feed it into analytics tools to help meet our scientists’ needs is a game changer.” Komprise Smart Data Workflows are relevant across many sectors. Here’s an example from the pharmaceutical industry: 1) Search: Define and execute a custom query across on-prem, edge and cloud data silos to find all data for Project X with Komprise Deep Analytics and the Komprise Global File Index. 2) Execute & Enrich: Execute an external function on Project X data to look for a specific DNA sequence for a mutation and tag such data as "Mutation XYZ". 3) Cull & Mobilize: Move only Project X data tagged with "Mutation XYZ" to the cloud using Komprise Deep Analytics Actions for central processing. 4) Manage Data Lifecycle: Move the data to a lower storage tier for cost savings once the analysis is complete. Other Smart Data Workflow use cases include: Legal Divestiture: Find and tag all files related to a divestiture project and move sensitive data to an object-locked storage bucket and move the rest to a writable bucket. Autonomous Vehicles: Find crash test data related to abrupt stopping of a specific vehicle model and copy this data to the cloud for further analysis. Execute an external function to identify and tag data with Reason = Abrupt Stop and move only the relevant data to the cloud data lakehouse to reduce time and cost associated with moving and analyzing unrelated data. “Whether it’s massive volumes of genomics data, surveillance data, IoT, GDPR or user shares across the enterprise, Komprise Smart Data Workflows orchestrate the information lifecycle of this data in the cloud to efficiently find, enrich and move the data you need for analytics projects. “We are excited to move to this next phase of our product journey, making it much easier to manage and mobilize massive volumes of unstructured data for cost reduction, compliance and business value.” Kumar Goswami, CEO of Komprise About Komprise Komprise is a provider of unstructured data management and mobility software that frees enterprises to easily analyze, mobilize, and monetize the right file and object data across clouds without shackling data to any vendor. With Komprise Intelligent Data Management, you can cut 70% of enterprise storage, backup and cloud costs while making data easily available to cloud-based data lakes and analytics tools.

Read More

Spotlight

Where business intelligence provide answers to known questions, big data discovery reveals unknown patterns, relationships and insights in any data. Leveraging the power of Hadoop, Datameer makes it easy for anyone to quickly discover insights in any data, right away.

Resources