Making Big Data Processing Simple with Spark," Matei Zaharia

As data volumes grow, we need programming tools for parallel applications that are as easy to use and versatile as those for single machines. The Spark project started at UC Berkeley to meet these goals. Spark is based on two main ideas. First, it has a language-integrated API in Python, Java, Scala and R, based on functional programming, that makes it easy to build applications out of functions to run on a cluster. Second, it offers a general engine that can support streaming, batch, and interactive computations, as well as advanced analytics such as machine learning, and lets users combine them in one program. Since its release in 2010, Spark has become a highly active open source project, with over 900 contributors and a broad set of built-in libraries. This talk will cover the main ideas behind the Spark programming model, and recent additions to the project…

Spotlight

Nuvento

A consulting firm that specializes in Information Management, Business Intelligence, Data Management, Reporting, Analytics, Software design and development. Nuvento is headquartered in Lenexa, Kansas. Core to our value proposition is our delivery model based on Centers of Excellence (CoE) that focus on Business Intelligence with specialization in OBIEE, Microsoft BI, Insurance BI, Telecom BI, Social Media Analytics, SharePoint and .NET. Our clients benefit from our pre-packaged product suite, BI Report Center (Web based OLAP Reporting platform using SSAS), ETL Center (Web based ETL Management Framework for SSIS), Implementation frameworks like Task Bolt (Open Source Portfolio Management Methodology), Software Architecture & Refactoring Methodology as well as expert SharePoint implementation consulting and development. Proven industry subject matter expertise within the fields of engineering, insurance, finance, telecom, and logistics. Our expert team includes specialists in Architectur

OTHER VIDEOS

Understanding User Entity and Behavior Analytics (UEBA)

video | August 11, 2023

User Entity and Behavior Analytics (UEBA) is a cybersecurity technology and approach that focuses on analyzing the behavior of users and entities (such as devices, applications, and systems) within an organization's IT environment. By using advanced data analytics, machine learning algorithms, and artificial intelligence, UEBA aims to detect and prevent cyber threats by identifying anomalies, deviations, or patterns in user and entity activities that might indicate potential security risks....

Watch Now

How to Use Analytics in UX

video | August 14, 2023

Integrating analytics into UX work helps to make data-based decisions and focus efforts on projects with the most significant impact....

Watch Now

Forestat Global: Data and Analytics Spanning the Entire Forest Value Chain

video | July 28, 2023

The only business intelligence platform with data and analytics spanning the entire forest products value chain—from wood to biofuel. With a wealth of information and global coverage, Forestat Global helps decision-makers navigate the forest products-based market with confidence....

Watch Now

Modak Nabu: An Integrated Data Engineering Platform

video | July 28, 2023

Are you looking for a solution to break down data silos, democratize access, and unleash the true power of data within your enterprise? Check out how Modak has brought automation of the data life cycle within an organization leading toward innovation. Modak is a leading provider of data engineering solutions, empowering organizations to harness the power of their data with a focus on simplicity, automation, and scalability. Modak offers innovative products and services that streamline data processes and drive actionable insights....

Watch Now

Spotlight

Nuvento

A consulting firm that specializes in Information Management, Business Intelligence, Data Management, Reporting, Analytics, Software design and development. Nuvento is headquartered in Lenexa, Kansas. Core to our value proposition is our delivery model based on Centers of Excellence (CoE) that focus on Business Intelligence with specialization in OBIEE, Microsoft BI, Insurance BI, Telecom BI, Social Media Analytics, SharePoint and .NET. Our clients benefit from our pre-packaged product suite, BI Report Center (Web based OLAP Reporting platform using SSAS), ETL Center (Web based ETL Management Framework for SSIS), Implementation frameworks like Task Bolt (Open Source Portfolio Management Methodology), Software Architecture & Refactoring Methodology as well as expert SharePoint implementation consulting and development. Proven industry subject matter expertise within the fields of engineering, insurance, finance, telecom, and logistics. Our expert team includes specialists in Architectur

Events