DATA SCIENCE

Cazena Announces the Launch of Instant AWS Data Lake to Accelerate Analytics Migration to AWS

Cazena | November 02, 2020

Cazena today declared the dispatch of the Instant AWS Data Lake. Creation prepared in only minutes, the Instant AWS Data Lake is the quickest and most cost-proficient answer for undertakings new to AWS or battling through months-long DIY information lake ventures. The Instant AWS Data Lake has been created in partnership with AWS; Cazena is an AWS Partner Network (APN) Advanced Technology Partner.

With the Instant AWS Data Lake, Cazena currently gives a first-in-industry "simple catch" for AWS examination and moving undertakings' AI/ML activities forward. The Instant AWS Data Lake is prepared for examination in minutes, without requiring operational aptitudes or assets. All endeavors looking to modernize around cloud information lakes can now safely and quickly relocate to AWS, and advantage from the rich and consistently developing investigation stack that AWS offers. Cazena's Instant AWS Data Lake organizes and incorporates bunch AWS investigation administrations – from ingestion to examination – into a bound together, simple to-work, and creation prepared SaaS experience. This experience incorporates consistent associations with AWS solutions including EMR, Athena, Glue, MSK, S3, SageMaker, and more.

“Getting cloud data lakes off the ground continues to be a major source of frustration for enterprises,” said Prat Moghe, CEO, Cazena. “Production deployments often require a minimum of six months to get off the ground, and millions of dollars are spent annually on operations teams to build and manage them – if a business can recruit, hire, and retain this particularly scarce talent. Cazena’s Instant Cloud Data Lakes deliver a secure, hybrid, production-ready experience – with instant time-to-analytics – without requiring additional skills or resources. And we are delivering this SaaS solution at half the cost of DIY data lakes.”

“Cloud data lakes are increasingly the focus for teams building data engineering and data science. Modern cloud data lakes go far beyond storage and need to deliver capabilities for data ingestion, analytics, and AI/ML,” said Daniel Parton, Lead Data Scientist at Bardess, a business analytics and data strategy company using Cazena for cloud data lakes. “Cazena’s turnkey cloud data lake solution significantly reduces the time it takes to stand up a production data lake while addressing the complexity of managing the environment. Cazena’s ability to provide the Instant AWS Data Lake as a SaaS experience is particularly noteworthy for any enterprise embracing data science, machine learning and digital transformation.”

What is a cloud data lake?

Modern cloud data lakes are more than storage or cataloging – they represent the complete production analytical environment, from ingestion to storage to processing and tools. Cloud data lakes provide a flexible and unified analytical platform for enterprises that need to modernize their data environments and migrate analytical workloads to the cloud. Cloud data lakes are ideal environments for AI/ML, data engineering and other analytics since they support “beyond SQL” processing with multiple databases like SQL, Spark, Search, etc. Cloud data lakes complement cloud data warehouses which support SQL-only processing for BI.

The challenges of cloud data lakes

Enterprises continue to face several significant obstacles when deploying and managing cloud data lakes. A lack of skills remains among the biggest hurdles, and this often translates to months-long efforts to deploy production data lakes. Most of this effort is spent in bespoke DevOps around orchestration, identity management, security, compliance, and ongoing monitoring and operations of the end-to-end data lake environment.

Key benefits and capabilities of Cazena’s Instant AWS Data Lake:

Analytics that are ready in minutes. The Instant AWS Data Lake is an automated turnkey analytical environment, from ingestion to tools. The SaaS solution includes connectivity to on-premises data sources and users, security controls, and other mission-critical cloud resources. All AWS analytics services including EMR, Athena, SageMaker, MSK, Glue, and others are orchestrated, provisioned, and configured with unified identity management so that enterprise users can on-board immediately. The cloud data lake can be deployed either as a standalone account or attach to an enterprises’ existing AWS account.

Continuous ops that optimize costs and SLAs. The Instant AWS Data Lake is continuously monitored and optimized for workload performance, cost, and availability. Existing data teams can now use an AWS data lake without requiring dedicated DevOps or CloudOps resources. Cazena’s Instant AWS Data Lake solution is less than half the cost of typical do-it-yourself AWS data lakes – and without the headaches.

Built-in security and compliance as a private SaaS on AWS. Enterprises get their own Instant AWS Data Lake delivered as a private, fully secured cloud service that is encrypted and continuously monitored for security and compliance. Built-in controls are default for SOC-2, GDPR, HIPAA, CCPA, and other industry regulations.

Self-service analytics with one-click access to all tools: The Instant AWS Data Lake offers a comprehensive console for AWS analytical tools like SageMaker, QuickSight, and other third-party tools. BI analysts, data engineers, data scientists, and analytics-dependent users get secure one-click access to a complete cloud data lake with their favorite tools, whether in the cloud or from their on-premises environment.

About Cazena

Cazena makes cloud data lakes easy for all enterprises. Cazena’s Instant Cloud Data Lake accelerates time-to-analytics and AI/ML from months to minutes. Powered by its patented and fully-automated Open SaaS Data platform, Cazena delivers the first SaaS experience for data lakes – zero operations required. Founded by Netezza leaders, Cazena is revolutionizing cloud data lakes.

Spotlight

http://ibm.co/16MDdZR Big Data & Analytics is enabling companies to deliver the right message, to the right person, at the right time, for the right price. Leading marketers are using this advantage to deliver greater value and relevance to their customers. Learn how in this video from IBM.


Other News
BUSINESS INTELLIGENCE, BIG DATA MANAGEMENT

EY announces alliance with Alteryx to help accelerate digital transformation through analytics automation

EY | November 28, 2022

The EY organization today announces an alliance between Alteryx, one of the leaders in analytics automation, and Ernst & Young LLP (EY US), to help organizations unlock the power of data through automation and digital transformation. Most organizations face inefficiencies and increased costs when carrying out day-to-day business and back-office operations. As they undergo digital transformation efforts, they tend to devote more time to data manipulation than to data analysis. As a result, revisiting and updating their existing technologies to improve data literacy throughout the organization becomes critical to achieve their transformation goals. The EY–Alteryx Alliance will help clients across various sectors optimize data-driven processes by generating valuable insights to deliver faster, better business outcomes to achieve efficiency in business operations. The alliance leverages the highly intuitive and easy-to-learn data analytics automation platform of Alteryx along with the EY organization's digital transformation capabilities across Strategy and Transactions, Consulting and Tax. The Alteryx platform combines three key pillars of automation and digital transformation — data, processes and people — to help enable data democratization, business process automation and people upskilling. Users are then better able to unlock the value of advanced analytics using its user-friendly platform, analyze a wide range of data from multiple sources and deliver business insights to answer business questions more efficiently. Among other strengths, EY US is well-known among clients and in the market for its consulting capabilities. With more than 700 certified implementers of Alteryx across service lines and countries, EY US teams have built innovative, proprietary solutions that are supported by Alteryx. Through the EY–Alteryx Alliance, clients gain access to and counsel from the right technology and consulting talent for data exploration, transformation and analysis. Brian May, EY Americas Alliance and Managed Services Leader, says: "This collaboration combines advanced technology and consulting capabilities for data exploration and analysis across key functional areas including tax, finance, human resources, supply chain, internal audit and IT. Activating and accelerating rapid digital transformation is paramount in helping organizations efficiently navigate today's evolving business landscape." Barb Huelskamp, Alteryx SVP of Channel Sales, says: "By aligning the EY organization's rich heritage of experience with the Alteryx analytics automation platform, we provide incremental value for key customer segments across the office of finance, human resources, supply chain and more. Our shared objective helps organizations optimize analytics to help drive large-scale business transformations." About EY EY exists to build a better working world, helping to create long-term value for clients, people and society and build trust in the capital markets. Enabled by data and technology, diverse EY teams in over 150 countries provide trust through assurance and help clients grow, transform and operate.

Read More

BUSINESS INTELLIGENCE, BIG DATA MANAGEMENT, DATA SCIENCE

OceanBase Cloud now available in AWS Marketplace, expanding its distributed database services to customers globally

OceanBase | October 14, 2022

OceanBase, a distributed relational database solution provider, today announced that its cloud service OceanBase Cloud is now available in Amazon Web Services (AWS) Marketplace, a digital catalog with thousands of software listings from independent software vendors that make it easy to find, test, buy and deploy software that runs on AWS. The launch allows AWS customers globally to quickly and easily access OceanBase’s database services, which are designed to deliver ultra-fast performance, elastic scalability and cost-effectiveness for transactional and operational analytics workloads. OceanBase is the only distributed database that has refreshed both TPC-C and TPC-H records. Its innovative city-level disaster recovery standard of "Five IDCs across Three Cities" enables zero Recovery Point Objective (RPO) and less than 30 seconds of Recovery Time Objective (RTO), and the service’s high compression technology saves 70-90% storage cost without compromising performance. Additionally, OceanBase’s multi-tenancy flexibility ensures better resource utilization, which helps reduce costs. The listing in AWS Marketplace provides more customers worldwide with one-stop access to OceanBase Cloud services, including database monitoring, diagnostics, development, migration, backup, and restoration. “We are delighted to collaborate with AWS Marketplace to bring OceanBase to more customers globally, at a time when enterprises are increasingly looking for consistent, scalable, resilient, and cost-effective database solutions amid their growing needs for efficient data analytics capabilities. “OceanBase, which is cloud-neutral, will continue to work with cloud vendors to broaden access to our time-tested capabilities in managing data-intensive transactions and multiple real-time analytical workloads.” Yin Boxue, General Manager of Public Cloud Services Division, OceanBase OceanBase has a ten-plus year track record of successfully managing large-scale and complex database needs, including supporting Alipay in processing peak transaction volumes for Alibaba Group’s annual 11.11 Global Shopping Festival -- one of the world’s largest online-shopping events. To date, OceanBase has served over 400 customers worldwide, including some of the largest financial institutions in China, including the Industrial and Commercial Bank of China, which have selected OceanBase as their preferred technology provider to upgrade their core IT systems. In Southeast Asia, OceanBase provides database solutions to GCash, one of the leading mobile payment providers and the largest e-wallet in the Philippines, and Dana, one of the leading digital wallet providers in Indonesia. OceanBase has helped reduce GCash’s database resource cost by more than 40% on average after adoption. OceanBase 4.0, which supports both vertical and horizontal scalability capabilities, was released in August 2022. With its distributed monolithic architecture, OceanBase 4.0 can be deployed on any stand-alone machine while performing the full functions as a distributed deployment. This lowers the technology and cost barriers for various types of customers, especially for small and medium-sized enterprises (SMEs) to adapt to using enterprise databases. About OceanBase: Launched in 2010, OceanBase is a distributed relational database. OceanBase’s strengths over alternative solutions include strong data consistency, high availability, high performance, cost effectiveness, elastic scalability, and high compatibility with mainstream relational databases. It enables transactions and analytical queries with just one set of engines, empowering real-time business intelligence. In May 2020, OceanBase set the world record for online transaction processing performance, with 707 million transactions per minute in a TPC-C benchmark test. OceanBase was acknowledged as a notable vendor by Forrester in its report “The Translytical Data Platforms Landscape, Q3 2022,” published in July, 2022.

Read More

BUSINESS INTELLIGENCE, BIG DATA MANAGEMENT

Cribl and Cloudian Offer S3 Data Lake-based Observability Platform for Modern Data Analytics

Cribl | October 12, 2022

Cribl and Cloudian® today announced a modern data analytics solution that integrates Cribl Stream and Cloudian HyperStore® object storage to provide an observability platform on an on-premises, S3-compatible data lake. The joint solution enables organizations to ingest, parse, restructure and enrich large data volumes in flight, ensuring they get the right data in the formats they need — all securely behind their firewall. Users can convert logs into metrics, reduce cost, and increase search speed. The Need for an Observability Platform Monitoring solutions alone only help enterprises answer well-known questions about their infrastructure and environment. To get new signals or opportunities, they also need an observability platform that can ingest and store data from multiple sources with full fidelity and can replay the relevant data into various analytics tools without adding new infrastructure and agents. This allows IT teams to perform advanced analytics, getting full visibility and insight into their environment, from applications to infrastructure assets. The Cribl-Cloudian Solution Cribl Stream is an observability pipeline that collects data from any source and can send and replay data to a Cloudian HyperStore S3-compatible data lake. The data is stored in HyperStore with full fidelity and is always available to search and analyze. HyperStore can scale up to thousands of nodes across multiple data centers, supporting millions of users and hundreds of petabytes of data. Other solution benefits include: Cost-effective, limitless scalability for long-term compliance – Data can be reformatted via a pre-parser, helping users mask, reduce, or restructure and route it to multiple destinations, such as low-cost Cloudian storage (½¢/GB/month or less) for long-term retention. Replay data – Customers can replay multiple data formats stored in the data lake to popular analytics and search platforms. Hybrid cloud ready – Users can employ policy-based tools to replicate or tier data to AWS, GCP, Azure, or another Cloudian HyperStore cluster for offsite disaster recovery, capacity expansion or data analysis in the cloud, enabling cost-effective storage across environments but managed as a single pool. Military-grade security and ransomware protection – Features include AES-256 server-side encryption for data stored at rest, SSL for data in transit (HTTPS), role-based access controls with specified levels of access, audit trail logging and data immutability for protection against ransomware. Data resiliency – The solution provides up to 14 nines of resiliency and supports administrator-selectable storage policies based on replication or erasure coding, along with fine grain control of data placement across data centers. “Giving our customers freedom and control over their data is the core of Cribl. “When customers use Cloudian and Cribl together, they can collect the largest amounts of data in any format with Cribl Stream and store and replay full fidelity data leveraging Cloudian HyperStore, saving costs and optimizing performance.” Zac Kilpatrick, vice president, Global Channels & Alliances, Cribl “Organizations are continually striving to gain deeper insights from their data to drive greater business and operational advancements,” said Larry Meese, vice president, Products and Solutions, Cloudian. “The Cribl-Cloudian solution enables them to easily and cost-effectively apply modern analytics applications to their on-prem data with the same scalability and flexibility of public cloud services while avoiding the cost and performance issues of moving large data volumes to the cloud.” About Cribl Cribl makes open observability a reality for today’s tech professionals. The Cribl product suite defies data gravity with radical levels of choice and control. Wherever the data comes from, wherever it needs to go, Cribl delivers the freedom and flexibility to make choices, not compromises. It’s enterprise software that doesn’t suck, enables tech professionals to do what they need to do, and gives them the ability to say “Yes.” With Cribl, companies have the power to control their data, get more out of existing investments, and shape the observability future. Founded in 2017, Cribl is a remote-first company with an office in San Francisco, CA. About Cloudian Cloudian is the leader in data management software for the hybrid cloud. With military-grade security, limitless scalability and seamless cloud integration, Cloudian’s S3-compatible object storage lets users optimize data access, meet data sovereignty requirements and cut costs by consolidating information to a single, cloud-like platform. Cloudian’s geo-distributed architecture manages and protects object and file data at the edge, core, and in the cloud, for both conventional and modern applications.

Read More

BUSINESS INTELLIGENCE, BIG DATA MANAGEMENT, DATA ARCHITECTURE

Mode Analytics Recognized as a Leader in Snowflake’s Modern Marketing Data Stack Report

Mode Analytics | September 30, 2022

Mode Analytics today announced that it has been recognized as a Business Intelligence Leader in the inaugural Modern Marketing Data Stack Report: Your Technology Guide to Unifying, Analyzing, and Activating the Data that Powers Amazing Customer Experiences, executed and launched by Snowflake, the Data Cloud company. Snowflake’s data-backed report identifies the best of breed solutions used by Snowflake customers to show how marketers can leverage the Snowflake Data Cloud with accompanying partner solutions to best identify, serve, and convert valuable prospects into loyal customers. By analyzing usage patterns from a pool of nearly 6,000 customers, Snowflake identified six technology categories that organizations consider when building their marketing data stacks. These categories include: Analytics Integration & Modeling Identity & Enrichment Activation & Measurement Business Intelligence Data Science & Machine Learning Focusing on companies that are active members of the Snowflake Partner Network (or ones with a comparable agreement in place with Snowflake), as well as Snowflake Marketplace Providers, the report explores each of these categories that comprise the Modern Marketing Data Stack, highlighting technology partners and their solutions as “leaders” or “ones to watch” within each category. The report also details how current Snowflake customers leverage a number of these partner technologies to enable data-driven marketing strategies and informed business decisions. Snowflake’s report provides a concrete overview of the partner solution providers and data providers marketers choose to create their data stacks. “Marketing professionals continue to expand their investment in analytics to improve their organization’s digital marketing activities. “Mode has emerged as a leader in the Modern Marketing Data Stack, with joint customers leveraging their technology to interpret insights that lead to informed business decisions.” Denise Persson, Chief Marketing Officer at Snowflake Mode was identified in Snowflake’s report as a Leader in the Business Intelligence category for its particular success with Visual Explorer, Mode’s flexible visualization system that helps analysts explore data faster and provides easy-to-interpret insights to business stakeholders. Additionally, Mode and Snowflake have partnered in the past couple of years tocreate a modern data analytics stack, mobilizing the world’s data with the Snowflake Data Cloud to help joint customers quickly execute queries and perform analysis. “Mode combines the best elements of business analytics and data science into a single platform, unlocking new ways for marketers to accelerate data-driven outcomes,” said Gaurav Rewari, CEO, Mode Analytics. “Our partnership with Snowflake makes it possible for marketing and other departments across an organization to truly centralize and interact directly with their data. With Snowflake’s single, integrated data platform, built to fully leverage the speed and flexibility of the cloud, organizations can mobilize their data in near-real time.” About Mode Analytics Mode’s advanced analytics platform is designed by data experts for data experts. It allows data scientists and analysts to visualize, analyze, and share data using a powerful end-to-end workflow that covers everything from early data exploration stages to presentation-ready shareable products. Unlike traditional business intelligence tools that produce static dashboards and reports, Mode brings the best of BI and data science together in a single platform, empowering everyone at your organization to use data to make high quality, high velocity decisions. Mode also supports the analytics community with free learning resources such as SQL School, open source SQL queries, and free tools for anyone analyzing public data.

Read More

Spotlight

http://ibm.co/16MDdZR Big Data & Analytics is enabling companies to deliver the right message, to the right person, at the right time, for the right price. Leading marketers are using this advantage to deliver greater value and relevance to their customers. Learn how in this video from IBM.

Resources