BIG DATA MANAGEMENT

MayStreet Launches Next Generation of Market Data Analytics Product

MayStreet | March 02, 2022

MayStreet, the industry’s leading market data technology and content provider, today announced the launch of the next generation of Analytics Workbench, the firm’s ready-to-use, cloud-based market data analytics environment. Through Analytics Workbench, data analysts can quickly and efficiently query the MayStreet Market Data Lake to drive mission-critical trading workflows without having to manage data capture, delivery or storage.

Key features of new Analytics Workbench include the ability to:

  • Query and extract data using Python or R for analysis within Workbench or in any other location, whether in the cloud or on-premise
  • Leverage pre-configured Jupyter® notebooks to support out-of-the-box query capabilities
  • Perform ad-hoc analyses or schedule batch jobs to support ongoing reporting requirements
  • Instantly parallelize and scale ad-hoc or scheduled code across the cloud with integrated support for Dask clusters
  • Upload internal order data or other third-party data to leverage in conjunction with MayStreet market data to support TCA, fill analysis and best execution reporting
  • Query results provided in normalized or raw PCAP formats and create reports using powerful visualization tools
  • Flexible deployment options, either fully managed within MayStreet’s cloud environment or integrated within a client’s cloud
  • Achieve performance objectives with optimized cluster parallelization

“The completely revamped Analytics Workbench realizes our goal of letting users bring their queries to our data, freeing them from the difficult and costly work of managing the data themselves. For the first time, our vast repository of ultra-high-quality global market data is accessible in a ready-to-use environment that leverages cloud economics. It’s also highly customizable, letting clients choose the level of performance they desire so that costs can be managed based on their needs. For capital markets data analysts, the new Analytics Workbench is a true gamechanger.”

Naftali Cohen, MayStreet’s Chief Revenue Officer

Dave Thompson, Senior Vice President, Frontend Engineering, added: “In the process of redeveloping Analytics Workbench from the ground up, we identified several tools such as Dremio, Dask and Jupyter that could elevate its performance, functionality and scalability. By integrating these and other technologies, we have been able to create a truly modern data access and analytics tool built for the cloud. We’ve had many conversations with clients over the past 18 months about their hopes for a product like this, and we’re very pleased with the end result.”

The new version of Analytics Workbench is currently being used by multiple clients, including a global investment bank, an exchange and a quantitative hedge fund. MayStreet expects additional clients to begin using the product over the coming weeks. Analytics Workbench was used by market structure researchers Robert Bartlett, Justin McCrary and Maureen O’Hara for their recent paper on the impact of odd lot quotes.

“MayStreet’s comprehensive, high-quality exchange data allowed us to document the vital importance of odd lot quotes in today’s equity markets, especially for higher-priced stocks,” said Bartlett, Faculty Director at the Berkeley Center for Law, Business and the Economy. “Without reliable access to the historical data feeds for all exchanges, such a study would simply not be possible given that these quotes are excluded from the SIP data. Additionally, leveraging MayStreet’s Analytics Workbench gave us the computational capacity we needed to process quickly the vast quantities of data.”

MayStreet’s release of Analytics Workbench is the latest in a series of enhancements to the MayStreet Market Data Lake. Other recent enhancements include access through a new High Performance Query (HPQ) API, the introduction of full-depth-of-book data for all US listed options markets and the round out of its global coverage with the addition of all major equities and futures markets in Asia-Pacific.

About MayStreet
MayStreet delivers the highest-quality, most complete global market data available. The firm’s solutions – which include the highly accessible Market Data Lake feed repository and Bellport Enterprise feed handler – help market participants generate maximum value from exchange data by delivering it when, where and how they want to receive it. With MayStreet, clients are freed from the difficult and costly work of sourcing and processing market data, leading to lower total cost of ownership, improved decision-making and better performance.

Spotlight

Big Data Analytics and Deep Learning are two high-focus of data science. Big Data has become important as many organizations both public and private have been collecting massive amounts of domain-specific information, which can contain useful information about problems such as national intelligence, cyber security, fraud detection, marketing, and medical informatics. Companies such as Google and Microsoft are analyzing large volumes of data for business analysis and decisions, impacting existing and future technology. Deep Learning algorithms extract high-level, complex abstractions as data representations through a hierarchical learning process. Complex abstractions are learnt at a given level based on relatively simpler abstractions formulated in the preceding level in the hierarchy. 

Spotlight

Big Data Analytics and Deep Learning are two high-focus of data science. Big Data has become important as many organizations both public and private have been collecting massive amounts of domain-specific information, which can contain useful information about problems such as national intelligence, cyber security, fraud detection, marketing, and medical informatics. Companies such as Google and Microsoft are analyzing large volumes of data for business analysis and decisions, impacting existing and future technology. Deep Learning algorithms extract high-level, complex abstractions as data representations through a hierarchical learning process. Complex abstractions are learnt at a given level based on relatively simpler abstractions formulated in the preceding level in the hierarchy. 

Related News

BIG DATA MANAGEMENT

Arcadia Research Data Now Available in AWS Data Exchange

Arcadia | July 13, 2022

Arcadia, a leading data analytics platform for healthcare and life sciences, today announced the availability of Arcadia Research Data through AWS Data Exchange, an Amazon Web Services (AWS) industry data solution that makes it easy for customers to find, subscribe to, and use third-party data from a wide range of providers. It offers data delivery through files tables, and application programming interfaces (APIs) from 250+ third-party data set providers, all in one place. With a vast catalog of 3,000+ data products and straightforward subscription options using AWS account credentials, AWS Data Exchange makes it easy for customers to ingest third-party data and analyze it with a wide variety of AWS data and analytics and machine learning services. Through AWS Data Exchange, research organizations can easily submit a request to review and evaluate a 10,000-patient sample data set from Arcadia. Arcadia Research Data contains de-identified clinical data for nearly 50 million patients across the United States. This data is collected from integrated delivery networks and accountable care organization clients using Arcadia's analytical platform. Arcadia Research Data is built on an active clinical and claims-based patient population set that features comprehensive visibility across payers, multiple sites of care, and the entire clinical patient journey. Using AWS Data Exchange, research organizations can easily discover and evaluate Arcadia Research Data to unlock insights to support whole-person care. By removing the friction of finding, procuring, and using clinical data across global sources, AWS Data Exchange enables customers to quantify health outcomes, accelerate research and clinical trial design, and understand patient sentiment and social determinants of health. From a single cloud catalog, research organizations can easily find, subscribe to, and use thousands of diverse real-world data sets and healthcare APIs to generate evidence, identify trends, and accelerate research. "We are excited to release our data set more broadly in AWS Data Exchange to unlock transformative insights in life sciences research. "More importantly, Arcadia's work with AWS will help research organizations solve data-fulfillment challenges that have often hindered researchers' ability to quantify health outcomes, accelerate research, and improve patient care." Jim Robbins, SVP of Life Sciences at Arcadia About Arcadia Arcadia is dedicated to happier, healthier days for all. We transform data into powerful insights that deliver results. Through our partnerships with the nation's leading health systems, payers, and life science companies, we are growing a community of innovation to improve care, maximize value, and confront emerging challenges.

Read More

BUSINESS INTELLIGENCE,BIG DATA MANAGEMENT,DATA SCIENCE

Teradata Announces VantageCloud Lake for Driving Analytical Innovation at Scale

Teradata | August 30, 2022

Teradata today announced VantageCloud Lake, Teradata’s first product built on an all-new, next-generation cloud-native architecture. Based on the deep history and expertise of Teradata, VantageCloud Lake brings the proven power of Teradata Vantage, now called VantageCloud Enterprise, to a new offering that is born in the cloud – and designed to be automatically elastic and leverage low-cost object store at its core, but still powerful, easy to use and scale (or stop). This enables an expanded set of customers and workloads beyond what Teradata is traditionally known for: rather than focusing mainly on information technology (IT)-managed enterprise workloads, VantageCloud Lake is a self-service offering designed to bring the unparalleled capabilities of Vantage to broader and more diverse use cases. By allowing businesses to leverage the industry’s best advanced analytics capabilities and scale smarter, but with lower total cost of ownership, Teradata VantageCloud Lake is designed to rapidly accelerate business outcomes for virtually any use case, including smaller ad hoc, exploratory, and departmental workloads. Customers can choose from either edition – VantageCloud Lake or VantageCloud Enterprise – depending on their business needs because both provide the best of Teradata, including the company’s recognized workload management, incredible scale, financial governance, and data fabric. Together they form Teradata’s new cloud offering VantageCloud – the complete cloud analytics and data platform. “Teradata VantageCloud Lake is the result of a multi-year journey to create a new paradigm for data and analytics – one where superior performance, agility, and value all go hand-in-hand. “VantageCloud Enterprise – our established Vantage in the cloud offering – is the recognized price performance leader in the market. Teradata VantageCloud Lake offers all of those same benefits in a package that is appealing to diverse functions and roles, opening up an entirely new market segment for us. With Teradata VantageCloud Lake, we now support all analytic workload needs at every level in the organization, enabling companies to be more nimble, experimental, and innovative in an easy-to-use solution without losing the governance and cost visibility that Teradata is known for.” Hillary Ashton, Chief Product Officer at Teradata Companies today are using Teradata to run business critical workloads with strict service level agreements (SLAs) to meet core business needs, such as an airline having a reservation system up and running 24/7. As such, these workloads are highly governed by IT and sheltered from potential interference. The result is that establishing new projects, like a mobile customer engagement application, can be very difficult to get started - especially new analytics projects that can consume unpredictable resources and put SLAs at risk. Departmental workloads and exploratory data science projects are often delayed or rejected, as enterprise workloads are prioritized. These new projects then drive the adoption of shadow systems on alternate technologies, but as these shadow systems proliferate, so do the costs and governance challenges for the organization. With the introduction of Teradata VantageCloud Lake, organizations have a greater ability to innovate by quickly spinning up ad hoc, exploratory, and departmental workloads, for example, leveraging open, connected data and gaining easy access to all of the other benefits Teradata offers with its flagship product. Organizations can maintain governance and provide flexible but controlled compute resources to the business. With Teradata, customers using VantageCloud Lake or Enterprise Edition have a powerful, scalable, and sustainable cloud analytics and data platform that is designed to meet an enterprise’s growing need for a wide range of diversified use cases that spur innovation and advancement. Underpinning each of these offerings is Teradata’s industry-leading analytics capabilities, significantly expanded and re-launched today as ClearScape Analytics: which is designed to offer powerful, open, and connected analytics providing autonomy and ease of access to deliver real-time insights and optimize business results. About Teradata Teradata is the connected multi-cloud data platform for enterprise analytics company. Our enterprise analytics solve business challenges from start to scale. Only Teradata gives you the flexibility to handle the massive and mixed data workloads of the future, today

Read More

BIG DATA MANAGEMENT

Cloud-based Solution Provider 2600Hz Chooses Pica8 For Their Data Centers

Pica8 | July 15, 2022

2600Hz is a telecom services provider delivering cloud communications solutions to MSPs, ISPs, and telecom resellers. 2600Hz's platform KAZOO modernizes how businesses provide communications services to their customers. Whether it be voice, mobile, video, fax or SMS, 2600Hz simplifies and opens the cryptic black box of telecom. 2600Hz is privately held and is based in the San Francisco Bay area, with international offices. 2600Hz had standardized on Edgecore equipment with Cumulus Linux before NVIDIA acquired Cumulus and stopped supporting Broadcom. 2600Hz wanted to stay on an open networking platform to avoid lock-in. Upgrading to a Pica8 solution allowed 2600Hz to retain their hardware system of choice, and enabled a logical path to upgrade for future expansion. Products Used After a short evaluation, 2600Hz initially upgraded a small cluster from Cumulus to PicOS, and over time have transitioned 3 sites to PicOS. Pica8 spine/leaf software switches are deployed using Edgecore Open Network Platform in three Data Centers: New Jersey, Chicago and Sunnyvale Deployment Notes 2600Hz uses PicOS in both Top-of-Rack (TOR) and aggregation switches. The use cases are standard Data Center Networks where the TOR uses an MLAG configuration for high availability and the aggregation layer requires L3 routing. For the TOR layer, the MLAG must provide maximum LACP flexibility, including LACP fallback and fast mode, to enable PXE boot and fast fail over handling for various servers. "When asked about the decision process of choosing Pica8 during the evaluation stage, pointed out that support of open networking standards was crucial. PicOS is a Linux based NOS running on an unmodified Debian Linux. Orchestration with Ansible was the decisive factor in choosing Pica8. It is very similar to Juniper Junos. Anyone with Junos experience will easily make the transition to PicOS with minimal training The migration itself was done easily, and usability was intuitive. He highlighted the role of Pica8 Support in the transition process Pica8 Support has been extremely helpful to introduce features that the network depends on. LACP Fall Back is an example. This made the transition from Cumulus to Pica8 easier. PicOS integrates with 2600Hz's product offerings such as Self Sign-up; PicOS is very easy to use." -Tyler Kiziah, DevOps Manager2600Hz. About Pica8- Pica8 is the industry's open networking software alternative to Cisco, Juniper and Arista for the enterprise. Pica8's AmpCon™ Network Controller for centralized management and automation and PicOS® Software Switches for networking and security have successfully replaced Cisco DNA Center and Catalyst Switches and competing Juniper and Arista solutions for campus, data center and distributed site networks within Fortune 500 enterprises. Pica8 software is deployed at over 1,000 customers in over 40 countries.

Read More