Pre-Processing Big Data: Techniques to improve quality of big data analysis

Data in the real-world is almost always dirty, incomplete, scattered or inconsistent. For data scientists, 'janitor work' is a key hurdle to data insights. Whether you use big data for analytics or data science, with increasing variety and velocity of big data, the data pre-processing step can be the most time-consuming step in your data pipeline. Featuring engineering concepts and practical examples in Python and R, this webinar will focus on technical considerations and data engineering techniques to optimise data preparation to get the most value from your big data pipeline.
Watch Now

Spotlight

OTHER ON-DEMAND WEBINARS

Moving from centralized data platform to a federated data mesh

In this webinar, we will cover the pros and cons of building a centralized data lake vs federated data mesh. Traditionally data warehouses are built on the premise of centralized data. This requires team, process and tool alignment which adds significant complexity and layers of process. Oftentimes the internal conflicts lead to subpar data management and quickly fragments to siloed data processing and insights. We will provide our unbiased view on building federated data mesh and the benefits of building an operational metrics layer.
Watch Now

Data Privacy During Economic Downturn: How to Make It Work With Limited Resources?

The global spread of the novel coronavirus (COVID-19) and the economic impact that followed has prompted many businesses to furlough the workforce or migrate from the traditional office to remote-working environments. The volatile landscape has created incremental risks, especially for organizations heavily relying on IT Sec/Ops teams to monitor security and privacy and enforce regulatory compliance.
Watch Now

Trends for Modernizing Analytics and Data Warehousing in 2019

Arcadiadata

Brand new research published from Dresner Advisory Services digs deeply into the trends in 2018 around big data analytics. Where are organizations heading in 2019? How are analytic and data warehouse architectures evolving to enable faster and deeper self-service analytics and BI for organizations looking to create a competitive edge? How is public, private and hybrid clouds factoring into deployment decisions? What are the hottest open source projects from Apache Spark to Kudu, Kafka, Hadoop, and beyond?
Watch Now

How Enterprises Are Leveraging Data & Analytics to Deliver 2X More Value from Their Shared Services Centers

everestgrp

As technology adoption increases exponentially, organizations are challenged by the proliferation of data that the technology generates. Increasingly, Shared Services Global In-house Centers (GICs) are leading their organizations’ efforts to tame data and derive key insights from it. Based on our recent Pinnacle Model research on data & analytics maturity in SSCs/GICs, this webinar will show executives how they can build capabilities in their SSCs GICs to turn this challenge into a strategic asset, generating value and enhancing service delivery
Watch Now