Enabling Exploratory Analysis of Large Data with Apache Spark and R

R has evolved to become an ideal environment for exploratory data analysis. The language is highly flexible - there is an R package for almost any algorithm and the environment comes with integrated help and visualization. SparkR brings distributed computing and the ability to handle very large data to this list. SparkR is an R package distributed within Apache Spark. It exposes Spark DataFrames, which was inspired by R data.frames, to R. With Spark DataFrames, and Spark’s in-memory computing engine, R users can interactively analyze and explore terabyte size data sets.
Watch Now

Spotlight

OTHER ON-DEMAND WEBINARS

Webinar Recording: RDCA-DAP Collaboration with Clinerion for Real World Data Solutions

The Rare Disease Cures Accelerator–Data and Analytics Platform (RDCA-DAP®) is launching a new webinar series with the goal of sharing with the community examples of the use of rare disease person-level data in drug development and regulatory decision making. The series will feature 1-hour webinars highlighting analyses that have been done in individual disease areas, how they informed drug development, and how similar approaches could be applied to common drug development issues encountered in rare diseases. After each presentation time will be allowed for discussion with a panel of Quantitative and Regulatory experts around why the solutions presented were informative, lessons learned and how similar approaches could be applied to related problems.
Watch Now

Data visualization in Looker Studio: Better dashboards with Supermetrics Charts

It’s not always about the numbers; it’s more about how you present them in an easily understandable way. Not everything can go into a pie chart, and not everything looks right in a line graph. That’s why it’s crucial to know the best principles of data visualization.
Watch Now

SAP HANA : Future of Data Warehousing and Bigdata Analytics

SAP HANA

Webinar will discuss about: History and present of SAP Business Model. Overview of Technology underlying SAP HANA (In Memory Computing) , Bigdata, Space for Analytics. SAP HANA Modelling Environment (Architecture + Setting up Environment). Data warehousing Models (Star schema + concept of master data). Basic Graphical Views (attribute, analytical and calculation Views). Data base operation in Hana tool.
Watch Now

Data Storytelling With Multiexperiences

Gartner

Discussion Topics: - What is a data story - When and how should data storytelling be used - Which new skills and techniques do you need to create compelling data stories - Which experiences can help tell your story beyond dashboards on a 2D screen
Watch Now