Building Big Data Applications: Getting started with Apache Spark

October 27, 2015

These days business organizations are churning out huge amounts of transactional data, capturing trillions of bytes of infor-mation about their customers, suppliers, and operations. While this may have once concerned only a few data geeks, big data is now relevant across business sectors; consumers of products and services. Everyone stands to benefit from its application. The combination of massive data sets and the rapid development of new technologies capable of storing and processing infor-mation out of them have already transformed the way businesses operate. Over the last decades, internet giants like Amazon, Google, Yahoo!, eBay and Twitter have invented and used tools for working with colossal data sets that was beyond the realm of traditional data management tools.

Spotlight

Integra Technology Consulting

Integra Technology Consulting is a consulting and systems integration firm headquartered in Waltham, MA. Integra has deep expertise in databases. This includes traditional RDBMS and newer NoSQL and MPP technologies, which allows us to help companies select and implement the optimal technology for managing their data. Integra originated in 1986 as the first consulting company on the East coast to specialize in Oracle and related technologies. In the age of "Big Data", where RDBMS technology is ill-equipped to handle the ever-increasing volume, variety and velocity of data, Integra has gained expertise in Hadoop, Spark, AWS Redshift, Cassandra, and other NoSQL / MPP technologies so that we may continue to help our clients with their most challenging Information Technology initiatives.

OTHER WHITEPAPERS
news image

Build a thriving business using data and analytics at scale

whitePaper | February 7, 2020

The problem is that growing volumes of data can lead to data paralysis, because no one knows quite where to begin or how to use it. Siloed information sources, no data management strategy, disconnected spreadsheets or analytical tools and poor data quality all work to compound the issues you’re facing.

Read More
news image

Building a data fabric for analytics with Tableau

whitePaper | September 6, 2022

Data is the heartbeat of the modern enterprise. Technology has progressed to the point that data-driven decision making is the norm and data literacy is often prized above all other skills.

Read More
news image

Protect personally identifiable information with data privacy measures

whitePaper | September 23, 2022

The COVID-19 pandemic times will go down in the annals of history as the ones which saw organizations face significant disruption in the way they executed their frontend business and backend operations. The pandemic waves have made organizations do the unthinkable – allowing their workforce to either operate exclusively from their homes, or at a minimum, distribute their working hours between home and office.

Read More
news image

Vida for Retail: Rebooting Retail withthe Power of Data Analytics

whitePaper | January 11, 2023

There used to be a time when purchase decisions were based on recommendations of the neighborhood grocery manager. Today, these purchase decisions are driven by technology, bringing together the collective wisdom of consumers of the same product from across the world.

Read More
news image

Architecting for HIPAA Security and Compliance on Amazon Web Services

whitePaper | January 27, 2020

AWS maintains a standards-based risk management program to ensure that the HIPAA-eligible services specifically support the administrative, technical, and physical safeguards required under HIPAA. Using these services to store, process, and transmit PHI allows our customers and AWS to address the HIPAA requirements applicable to the AWS utility-based operating model.

Read More
news image

The Path to Self-Service Analytics on the Data Lake: Asset

whitePaper | May 2, 2022

Download this white paper to get a step-by-step roadmap for adopting Dremio and migrating workloads while maintaining coexistence and interoperability with existing systems and technologies.

Read More

Spotlight

Integra Technology Consulting

Integra Technology Consulting is a consulting and systems integration firm headquartered in Waltham, MA. Integra has deep expertise in databases. This includes traditional RDBMS and newer NoSQL and MPP technologies, which allows us to help companies select and implement the optimal technology for managing their data. Integra originated in 1986 as the first consulting company on the East coast to specialize in Oracle and related technologies. In the age of "Big Data", where RDBMS technology is ill-equipped to handle the ever-increasing volume, variety and velocity of data, Integra has gained expertise in Hadoop, Spark, AWS Redshift, Cassandra, and other NoSQL / MPP technologies so that we may continue to help our clients with their most challenging Information Technology initiatives.

Events