Q&A with Sadiqah Musa, Co-Founder at Black In Data

Sadiqah Musa, Co-Founder at Black In Data, is also an experienced Senior Data Analyst at Guardian News and Media with a demonstrated history of working in the energy and publishing sectors. She is skilled in Advanced Excel, SQL, Python, data visualization, project management, and Data Analysis and has a strong professional background with a Master of Science (MSc) from The University of Manchester.

As people become more aware and concerned about their data, the need for a regulating body will emerge, and organizations’ accountability will be mandated.



MEDIA 7: Could you please tell us a little bit about yourself? What inspired you to pursue a career in Data Analytics?
SADIQAH MUSA:
My name is Sadiqah Musa, I am a Co-Founder of Black in Data, a movement I started to push for inclusivity and diversity in the Data Analytics Industry. My current role is as a Senior Analyst at Guardian. Data Analytics found me as  I originally started my career as a geophysicist, focusing on the exploration of the oil and gas industry and analyzing seismic data. When people ask me, what inspired me to pursue a career change to Data Analytics, I always give an honest and real answer. I didn’t have a sort of epiphany moment where I realized Data Analytics was the new future and therefore I needed to be part of that game, instead, it was simple. At the time when I was considering other career options, there was a huge bust in the oil and gas industry, this is fairly standard and part of the ‘Boom Year, Bust Year’ cycle, however, I wanted a career to give me more stability. Therefore, Data Analytics was this new stable career field, which also allowed me to bring my technical skills of analyzing seismic data for 8 years and general comfort with the skillset I had already had.


M7: What impact do you think analytics & big data will have in the next 5 years?
SM:
I think two major things will happen in the next 5 years: There will be a push in Machine Learning Algorithms and discussions around the Ethics of AI. In terms of Machine Learning Algorithms, this is something that is already happening, but not on a scalable platform/level. The platforms we currently use to create machine learning algorithms are open-source and free, but you will need additional expensive software to turn all of your algorithms into a real, tangible product. At the moment, I'm noticing a shift in this situation, with the software becoming more accessible. As a result, I believe there will be a greater shift in machine learning and natural language processing in the future, where the algorithms will recognize speech and perform functions independently. Discussions on the Ethics of AI are also being held, but as the AI industry continues to grow so will our concern about its morality. A particular area is the biases within the algorithms that people have created, this is an industry that is not being regulated. When we look at things like facial recognition data, we can see that millions of images of our data are being stored without our knowledge. As people become more aware and concerned about their data, the need for a regulating body will emerge, and organizations’ accountability will be mandated.


Some companies and sponsors appear as incredibly supportive and in alignment with our organization however, they fail to provide the financial support needed as well as offer their resources to push our organization further.



M7: What have been your two biggest challenges as a co-founder at Black in Data, and how did you tackle them?
SM:
Our biggest challenge has been a reluctance to follow through with the discussions regarding implementing change and diversify the workplace. Whilst we have inspired a lot of initial interest and support regarding our movement, the difficulty lies within having our sponsors or facilitating companies commit to tangible change and action. However, we have been working to resolve this and openly having these discussions with our proposed sponsors. Also, another challenge we face, as many non-profit organizations must face, relates to our finances. Some companies and sponsors appear as incredibly supportive and in alignment with our organization however, they fail to provide the financial support needed as well as offer their resources to push our organization further.


M7: You have recently partnered with CodeUntapped, Frocentric_tech and Google Digital Garage. Could you please tell us about some of the developments that all of you at Black in Data are excited about?
SM:
At Black, In Data we are most excited about our latest partnership with CodeUntapped. It is a partnership where we get people from BID, put them into a training scheme with CodeUntapped and then put them into Jobs with amazing Organisations. The training scheme lasts for 6 weeks, is completely paid and by the end of the training scheme, the candidates are placed into the physical role that they have been training for. Once contracted for roughly 9 to 12 months, training continues. The usual outcome is securing a permanent role with the company. With Frocentric, this is an exciting person of the color networking data-industry site. It strives to connect all the different tech sectors to one platform where we can all network and support each other. It ranges from Black Founder of Data Science, AI Companies and Data companies and bringing them together to break down the compartmentalized and fractured industry. Finally, Google Digital Garage is an incredible platform that enables us to provide soft skills training ranging from CV writing to Google Digital analytics.


Traditionally people used to consume their news via structured platforms, however social media is allowing you to get the raw content from the people that are being the most impacted.



M7: What do you see as the most noticeable change right now happening in the workforce, encouraged by the rise of social media and digital technologies?
SM: 
The most noticeable change happening in the workforce is down to the impact of social media being used as a tool for awareness and awakening. Traditionally people used to consume their news via structured platforms, however social media is allowing you to get the raw content from the people that are being the most impacted. This has been fantastic for people of color, trying to educate on their lived experiences, share them and finally bringing it to the fore so that organizations have to address it. Topics typically neglected as well as deemed taboo have now been brought to light, forcing companies to evaluate their workplace treatments and biases.


M7: What is your advice to the young women who are starting out their careers in Data Analytics?
SM:
The most important thing to do is ensure that you have all of the necessary technical skills, such as being extremely proficient in Statistics, which is a critical component of Data Analytics. One thing I was never told about that I now realize is the significance of, is being able to do storytelling with Data. Usually, you might just think analyzing your data and presenting your findings is enough but you need to master how to explain your findings very well. It is also easy to second-guess yourself, so you need to be confident in your ability and trust yourself. Ultimately, I’d say be kind to yourself. Growing up, I was exceptionally scared of failure but it is all those mistakes that have shaped me into the analyst that I am today!

ABOUT BLACK IN DATA

Black in Data is a collaborative movement for people of colour, striving to promote equality of opportunity and representation within the data industry. Diversity and equality at work is crucial not only for social justice, but also for the development of a flourishing data and technology industry, able to adapt in a dynamic world. The face of the workforce is changing, and  every industry benefits from the greater breadth of knowledge, skills and personal experience that this greater diversity brings. Black in Data aims to energise and accelerate this pace of change.

More THOUGHT LEADERS

Q&A with Collette Johnson, Product Marketing Specialist at Redgate Software

Media 7 | January 12, 2021

Collette Johnson, Product Marketing Specialist at Redgate Software, serves as the key player in the marketing success of the organization. She is known for always delivering the highest standards of work for the benefit of the businesses she works for. MEDIA 7: Congratulations on being listed as one of the ‘Top 100 Product Marketing Influencers 2020’! What inspired you to get into marketing? COLLETTE JOHNSON: For me, it was all about the customer. Very early on in my career, I realized that enabling organizations to embrace new technology to drive their successes excited me, and this is what inspired me towards a career in marketing. I’m really passionate about understanding how to engage people, learning what messaging and narratives help them in making decisions, and how to enable sales to succeed through the materials we produce. I thoroughly enjoy the learning that you get in marketing and how everything you do is different as you are testing and iterating for the best outcome for customers....

Read More

Q&A with Alastair Speare-Cole, President of Insurance at QOMPLX

Media 7 | August 20, 2021

Alastair Speare-Cole, President and General Manager of the Insurance Division at QOMPLX, leads the overall strategy for the business unit, the development of QOMPLX’s underwriting-as-a-service platform, the management of the company’s Managing General Agent (MGA), as well as setting the direction for the company’s next-generation insurance decision platform that leverages a wide variety of data and advanced analytics to provide advanced risk and portfolio management solutions. Prior to joining QOMPLX, he served as Chief Underwriting Officer at Qatar, and he served as the CEO of JLT Towers from 2012 to 2015. He was also COO at Aon Re for ten years and has also held board appointments at reinsurance and banking subsidiaries in the United Kingdom....

Read More

Q&A with Vishal Srivastava, Vice President (Model Validation) at Citi

Media 7 | September 8, 2021

Vishal Srivastava, Vice President (Model Validation) at Citi was invited as a keynote speaker to present on Fraud Analytics using Machine Learning at the International Automation in Banking Summit in New York in November 2019. Vishal has experience in quantitative risk modeling using advanced engineering, statistical, and machine learning technologies. His academic qualifications in combination with a Ph.D. in Chemical Engineering and an MBA in Finance have enabled him to challenge quantitative risk models with scientific rigor. Vishal’s doctoral thesis included the development of statistical and machine learning-based risk models—some of which are currently being used commercially. Vishal has 120+ peer-reviewed citations in areas such as risk management, quantitative modeling, machine learning, and predictive analytics....

Read More

Q&A with Collette Johnson, Product Marketing Specialist at Redgate Software

Media 7 | January 12, 2021

Collette Johnson, Product Marketing Specialist at Redgate Software, serves as the key player in the marketing success of the organization. She is known for always delivering the highest standards of work for the benefit of the businesses she works for. MEDIA 7: Congratulations on being listed as one of the ‘Top 100 Product Marketing Influencers 2020’! What inspired you to get into marketing? COLLETTE JOHNSON: For me, it was all about the customer. Very early on in my career, I realized that enabling organizations to embrace new technology to drive their successes excited me, and this is what inspired me towards a career in marketing. I’m really passionate about understanding how to engage people, learning what messaging and narratives help them in making decisions, and how to enable sales to succeed through the materials we produce. I thoroughly enjoy the learning that you get in marketing and how everything you do is different as you are testing and iterating for the best outcome for customers....

Read More

Q&A with Alastair Speare-Cole, President of Insurance at QOMPLX

Media 7 | August 20, 2021

Alastair Speare-Cole, President and General Manager of the Insurance Division at QOMPLX, leads the overall strategy for the business unit, the development of QOMPLX’s underwriting-as-a-service platform, the management of the company’s Managing General Agent (MGA), as well as setting the direction for the company’s next-generation insurance decision platform that leverages a wide variety of data and advanced analytics to provide advanced risk and portfolio management solutions. Prior to joining QOMPLX, he served as Chief Underwriting Officer at Qatar, and he served as the CEO of JLT Towers from 2012 to 2015. He was also COO at Aon Re for ten years and has also held board appointments at reinsurance and banking subsidiaries in the United Kingdom....

Read More

Q&A with Vishal Srivastava, Vice President (Model Validation) at Citi

Media 7 | September 8, 2021

Vishal Srivastava, Vice President (Model Validation) at Citi was invited as a keynote speaker to present on Fraud Analytics using Machine Learning at the International Automation in Banking Summit in New York in November 2019. Vishal has experience in quantitative risk modeling using advanced engineering, statistical, and machine learning technologies. His academic qualifications in combination with a Ph.D. in Chemical Engineering and an MBA in Finance have enabled him to challenge quantitative risk models with scientific rigor. Vishal’s doctoral thesis included the development of statistical and machine learning-based risk models—some of which are currently being used commercially. Vishal has 120+ peer-reviewed citations in areas such as risk management, quantitative modeling, machine learning, and predictive analytics....

Read More

Related News

Big Data Management

Dremio Partners With Carahsoft to Bring Modern Data Infrastructure to the Public Sector

Dremio | January 05, 2024

Dremio, the easy and open data lakehouse, and Carahsoft Technology Corp., The Trusted Government IT Solutions Provider, today announced a partnership. Under the agreement, Carahsoft will serve as Dremio’s Master Government Aggregator®, making the company’s complete cloud and software portfolio for Government, Defense, Intelligence and Education available through Carahsoft’s reseller partners and NASA Solutions for Enterprise-Wide Procurement (SEWP) V, Information Technology Enterprise Solutions – Software 2 (ITES-SW2), National Association of State Procurement Officials (NASPO) ValuePoint, National Cooperative Purchasing Alliance (NCPA) and OMNIA Partners contracts. This collaboration paves the way for Public Sector organizations to harness cutting-edge data analytics capabilities which empower them to make smarter decisions and significantly enhance operational efficiency through lightning-fast data access. Dremio propels agencies into the future by embracing a state-of-the-art data lakehouse architecture in Public Sector organizations. By transitioning to Dremio's solutions, organizations can enjoy sub-second query performance and a remarkable 10-fold improvement in price performance. The new environment eliminates costly and complex legacy data lake solutions and implements a flexible, highly modern architecture. Dremio provides cost-effective self-service analytics and data management, simplifying data pipelines and ETL complexity while accelerating insights across diverse storage locations. With a proven track record of modernizing legacy Hadoop environments and implementing the modern data lakehouse solution, Dremio and Carahsoft aim to transform Public Sector data management by breaking down data silos. Embracing a data mesh concept enables efficient handling of various data sources, fostering cross- collaboration and decentralized data control. Dremio’s expertise supports Public Sector agencies in implementing these principles, eradicating data silos for a more collaborative and efficient approach to data analytics. "Public sector organizations face a range of data infrastructure challenges, many of which are common to both Government and non-government entities. These issues often hinder effective data management, analysis, and decision-making,” said Roger Frey, Vice President of Alliances at Dremio. “Dremio's mission is to make data easily accessible and analyzable for all users, regardless of where it resides. We are excited to partner with Carahsoft to bring our state-of-the-art data analytics solutions to the Public Sector." “Within the Public Sector’s intricate data landscape, complexities often impede efficient data management and decision making,” said Laura Howton, Sales Director who leads the Analytics and Data Management Team at Carahsoft. “By adding Dremio to our AI and Machine Learning portfolio, our reseller partners can now provide modern, cost-effective and easily accessible data analytics tools to Government customers, bolstering their modernization efforts.” Dremio’s solutions are available through Carahsoft’s SEWP V contracts NNG15SC03B and NNG15SC27B, ITES-SW2 Contract W52P1J-20-D-0042, NASPO ValuePoint Master Agreement #AR2472, NCPA Contract NCPA01-86 and OMNIA Partners Contract #R191902. For more information, contact the Carahsoft team at 571-591-6430 or Dremio@carahsoft.com; or register for a complimentary webcast, “The Key Steps in Decreasing Costs and Improving Performance Through Data Lake Modernization,” Thursday, January 25, at 2p.m. About Dremio Dremio is the easy and open data lakehouse, providing self-service analytics with data warehouse functionality and data lake flexibility across all of your data. Use Dremio's lightning-fast SQL query service and any other processing engine on the same data. Dremio increases agility with a revolutionary data-as-code approach that enables Git-like data experimentation, version control, and governance. In addition, Dremio eliminates data silos by enabling queries across data lakes, databases, and data warehouses, and by simplifying ingestion into the lakehouse. Dremio's fully managed service helps organizations get started with analytics in minutes, and automatically optimizes data for every workload. As the original creator of Apache Arrow and committed to Arrow and Iceberg’s community-driven standards, Dremio is on a mission to reinvent SQL for data lakes and meet customers where they are on their lakehouse journey.

Read More

Machine Learning

InterSystems Introduces Two New Cloud-Native Smart Data Services to Accelerate Database and Machine Learning Application Development

InterSystems | January 12, 2024

InterSystems, a creative data technology provider dedicated to helping customers solve their most critical scalability, interoperability, and speed problems, today announced general availability of the InterSystems IRIS Cloud SQL and InterSystems IRIS Cloud IntegratedML® services. These fully managed cloud-native smart data services empower developers to build cloud-native database and machine learning (ML) applications in SQL environments with ease. With Cloud SQL and Cloud IntegratedML, developers can access a next generation relational database-as-a-service (DBaaS) that is fast and easy to provision and use. Embedded AutoML capabilities allow developers to easily develop and execute machine learning models with just a few SQL-like commands in a fully-managed, elastic cloud-native environment. Shaping a complete data management portfolio for mission-critical applications As part of the InterSystems Cloud portfolio of smart data services, Cloud SQL and Cloud IntegratedML provide application developers with access to InterSystems proven enterprise-class capabilities as self-service, fully managed offerings on Amazon Web Services (AWS), while providing a fast and seamless on-ramp to the full suite of capabilities in InterSystems IRIS® data platform. InterSystems IRIS is a next-generation data platform designed for organizations implementing smart data fabrics that provide powerful database management, integration, and application development capabilities. By consolidating these capabilities into a single product, InterSystems IRIS accelerates the time it takes to realize value from data, simplifies overall system architectures, and reduces both maintenance effort and costs. “We are excited for the capabilities of InterSystems IRIS to be exposed through these new, easy to deploy and easy to use services,” said Scott Gnau, Global Head of Data Platforms at InterSystems. “With native support for AutoML, we give developers the power to build comprehensive, predictive, and prescriptive applications.” Fully managed, enterprise-class reliability with InterSystems IRIS Cloud SQL Cloud SQL makes it easy for application developers to leverage advanced relational database capabilities as a fully managed, secure, scalable, high performance, highly available cloud-native database-as-a-service (DBaaS). Cloud SQL delivers the following benefits for SQL developers: Extremely high performance, especially for ingesting and processing incoming data and performing SQL queries on the data with low latency at scale Fast and easy to provision and use Ability to easily connect client applications via JDBC, ODBC, DB-API, and ADO.NET drivers Automated security, data encryption, and backups Automation of machine learning tasks with InterSystems IRIS Cloud IntegratedML Available as an additional cloud managed service for InterSystems IRIS Cloud SQL customers, Cloud IntegratedML extends the capabilities of Cloud SQL to enable SQL developers to quickly build, tune, and execute machine learning models with just a few SQL-like commands, without moving or copying data to a different environment. A significant advantage of Cloud IntegratedML is the elimination of the need to transfer or replicate data to an external platform to build ML models, or to move ML models to a different environment for execution. Cloud IntegratedML delivers the following benefits for SQL developers: Automation of machine learning processes and resource-intensive tasks such as feature engineering, model development, and fine-tuning Seamless integration of models developed and trained with Cloud IntegratedML within Cloud SQL, facilitating real-time predictive insights and prescriptive actions in response to events and transactions This comprehensive suite of smart data services establishes the InterSystems Cloud portfolio of smart data services as an optimal choice for SQL developers seeking a robust, high-performance database solution tailored to their needs. The new Cloud SQL and Cloud IntegratedML services are available through InterSystems Developer Hub. About InterSystems Established in 1978, InterSystems is the leading provider of next-generation solutions for enterprise digital transformations in the healthcare, finance, manufacturing, and supply chain sectors. Its cloud-first data platforms solve interoperability, speed, and scalability problems for large organizations around the globe. InterSystems is committed to excellence through its award-winning, 24×7 support for customers and partners in more than 80 countries. Privately held and headquartered in Cambridge, Massachusetts, InterSystems has 38 offices in 28 countries worldwide. For more information, please visit InterSystems.com.

Read More

Big Data Management

data.world Integrates with Snowflake Data Quality Metrics to Bolster Data Trust

data.world | January 24, 2024

data.world, the data catalog platform company, today announced an integration with Snowflake, the Data Cloud company, that brings new data quality metrics and measurement capabilities to enterprises. The data.world Snowflake Collector now empowers enterprise data teams to measure data quality across their organization on-demand, unifying data quality and analytics. Customers can now achieve greater trust in their data quality and downstream analytics to support mission-critical applications, confident data-driven decision-making, and AI initiatives. Data quality remains one of the top concerns for chief data officers and a critical barrier to creating a data-driven culture. Traditionally, data quality assurance has relied on manual oversight – a process that’s tedious and fraught with inefficacy. The data.world Data Catalog Platform now delivers Snowflake data quality metrics directly to customers, streamlining quality assurance timelines and accelerating data-first initiatives. Data consumers can access contextual information in the catalog or directly within tools such as Tableau and PowerBI via Hoots – data.world’s embedded trust badges – that broadcast data health status and catalog context, bolstering transparency and trust. Additionally, teams can link certification and DataOps workflows to Snowflake's data quality metrics to automate manual workflows and quality alerts. Backed by a knowledge graph architecture, data.world provides greater insight into data quality scores via intelligence on data provenance, usage, and context – all of which support DataOps and governance workflows. “Data trust is increasingly crucial to every facet of business and data teams are struggling to verify the quality of their data, facing increased scrutiny from developers and decision-makers alike on the downstream impacts of their work, including analytics – and soon enough, AI applications,” said Jeff Hollan, Director, Product Management at Snowflake. “Our collaboration with data.world enables data teams and decision-makers to verify and trust their data’s quality to use in mission-critical applications and analytics across their business.” “High-quality data has always been a priority among enterprise data teams and decision-makers. As enterprise AI ambitions grow, the number one priority is ensuring the data powering generative AI is clean, consistent, and contextual,” said Bryon Jacob, CTO at data.world. “Alongside Snowflake, we’re taking steps to ensure data scientists, analysts, and leaders can confidently feed AI and analytics applications data that delivers high-quality insights, and supports the type of decision-making that drives their business forward.” The integration builds on the robust collaboration between data.world and Snowflake. Most recently, the companies announced an exclusive offering for joint customers, streamlining adoption timelines and offering a new attractive price point. The data.world's knowledge graph-powered data catalog already offers unique benefits for Snowflake customers, including support for Snowpark. This offering is now available to all data.world enterprise customers using the Snowflake Collector, as well as customers taking advantage of the Snowflake-only offering. To learn more about the data quality integration or the data.world data catalog platform, visit data.world. About data.world data.world is the data catalog platform built for your AI future. Its cloud-native SaaS (software-as-a-service) platform combines a consumer-grade user experience with a powerful Knowledge Graph to deliver enhanced data discovery, agile data governance, and actionable insights. data.world is a Certified B Corporation and public benefit corporation and home to the world’s largest collaborative open data community with more than two million members, including ninety percent of the Fortune 500. Our company has 76 patents and has been named one of Austin’s Best Places to Work seven years in a row.

Read More

Big Data Management

Dremio Partners With Carahsoft to Bring Modern Data Infrastructure to the Public Sector

Dremio | January 05, 2024

Dremio, the easy and open data lakehouse, and Carahsoft Technology Corp., The Trusted Government IT Solutions Provider, today announced a partnership. Under the agreement, Carahsoft will serve as Dremio’s Master Government Aggregator®, making the company’s complete cloud and software portfolio for Government, Defense, Intelligence and Education available through Carahsoft’s reseller partners and NASA Solutions for Enterprise-Wide Procurement (SEWP) V, Information Technology Enterprise Solutions – Software 2 (ITES-SW2), National Association of State Procurement Officials (NASPO) ValuePoint, National Cooperative Purchasing Alliance (NCPA) and OMNIA Partners contracts. This collaboration paves the way for Public Sector organizations to harness cutting-edge data analytics capabilities which empower them to make smarter decisions and significantly enhance operational efficiency through lightning-fast data access. Dremio propels agencies into the future by embracing a state-of-the-art data lakehouse architecture in Public Sector organizations. By transitioning to Dremio's solutions, organizations can enjoy sub-second query performance and a remarkable 10-fold improvement in price performance. The new environment eliminates costly and complex legacy data lake solutions and implements a flexible, highly modern architecture. Dremio provides cost-effective self-service analytics and data management, simplifying data pipelines and ETL complexity while accelerating insights across diverse storage locations. With a proven track record of modernizing legacy Hadoop environments and implementing the modern data lakehouse solution, Dremio and Carahsoft aim to transform Public Sector data management by breaking down data silos. Embracing a data mesh concept enables efficient handling of various data sources, fostering cross- collaboration and decentralized data control. Dremio’s expertise supports Public Sector agencies in implementing these principles, eradicating data silos for a more collaborative and efficient approach to data analytics. "Public sector organizations face a range of data infrastructure challenges, many of which are common to both Government and non-government entities. These issues often hinder effective data management, analysis, and decision-making,” said Roger Frey, Vice President of Alliances at Dremio. “Dremio's mission is to make data easily accessible and analyzable for all users, regardless of where it resides. We are excited to partner with Carahsoft to bring our state-of-the-art data analytics solutions to the Public Sector." “Within the Public Sector’s intricate data landscape, complexities often impede efficient data management and decision making,” said Laura Howton, Sales Director who leads the Analytics and Data Management Team at Carahsoft. “By adding Dremio to our AI and Machine Learning portfolio, our reseller partners can now provide modern, cost-effective and easily accessible data analytics tools to Government customers, bolstering their modernization efforts.” Dremio’s solutions are available through Carahsoft’s SEWP V contracts NNG15SC03B and NNG15SC27B, ITES-SW2 Contract W52P1J-20-D-0042, NASPO ValuePoint Master Agreement #AR2472, NCPA Contract NCPA01-86 and OMNIA Partners Contract #R191902. For more information, contact the Carahsoft team at 571-591-6430 or Dremio@carahsoft.com; or register for a complimentary webcast, “The Key Steps in Decreasing Costs and Improving Performance Through Data Lake Modernization,” Thursday, January 25, at 2p.m. About Dremio Dremio is the easy and open data lakehouse, providing self-service analytics with data warehouse functionality and data lake flexibility across all of your data. Use Dremio's lightning-fast SQL query service and any other processing engine on the same data. Dremio increases agility with a revolutionary data-as-code approach that enables Git-like data experimentation, version control, and governance. In addition, Dremio eliminates data silos by enabling queries across data lakes, databases, and data warehouses, and by simplifying ingestion into the lakehouse. Dremio's fully managed service helps organizations get started with analytics in minutes, and automatically optimizes data for every workload. As the original creator of Apache Arrow and committed to Arrow and Iceberg’s community-driven standards, Dremio is on a mission to reinvent SQL for data lakes and meet customers where they are on their lakehouse journey.

Read More

Machine Learning

InterSystems Introduces Two New Cloud-Native Smart Data Services to Accelerate Database and Machine Learning Application Development

InterSystems | January 12, 2024

InterSystems, a creative data technology provider dedicated to helping customers solve their most critical scalability, interoperability, and speed problems, today announced general availability of the InterSystems IRIS Cloud SQL and InterSystems IRIS Cloud IntegratedML® services. These fully managed cloud-native smart data services empower developers to build cloud-native database and machine learning (ML) applications in SQL environments with ease. With Cloud SQL and Cloud IntegratedML, developers can access a next generation relational database-as-a-service (DBaaS) that is fast and easy to provision and use. Embedded AutoML capabilities allow developers to easily develop and execute machine learning models with just a few SQL-like commands in a fully-managed, elastic cloud-native environment. Shaping a complete data management portfolio for mission-critical applications As part of the InterSystems Cloud portfolio of smart data services, Cloud SQL and Cloud IntegratedML provide application developers with access to InterSystems proven enterprise-class capabilities as self-service, fully managed offerings on Amazon Web Services (AWS), while providing a fast and seamless on-ramp to the full suite of capabilities in InterSystems IRIS® data platform. InterSystems IRIS is a next-generation data platform designed for organizations implementing smart data fabrics that provide powerful database management, integration, and application development capabilities. By consolidating these capabilities into a single product, InterSystems IRIS accelerates the time it takes to realize value from data, simplifies overall system architectures, and reduces both maintenance effort and costs. “We are excited for the capabilities of InterSystems IRIS to be exposed through these new, easy to deploy and easy to use services,” said Scott Gnau, Global Head of Data Platforms at InterSystems. “With native support for AutoML, we give developers the power to build comprehensive, predictive, and prescriptive applications.” Fully managed, enterprise-class reliability with InterSystems IRIS Cloud SQL Cloud SQL makes it easy for application developers to leverage advanced relational database capabilities as a fully managed, secure, scalable, high performance, highly available cloud-native database-as-a-service (DBaaS). Cloud SQL delivers the following benefits for SQL developers: Extremely high performance, especially for ingesting and processing incoming data and performing SQL queries on the data with low latency at scale Fast and easy to provision and use Ability to easily connect client applications via JDBC, ODBC, DB-API, and ADO.NET drivers Automated security, data encryption, and backups Automation of machine learning tasks with InterSystems IRIS Cloud IntegratedML Available as an additional cloud managed service for InterSystems IRIS Cloud SQL customers, Cloud IntegratedML extends the capabilities of Cloud SQL to enable SQL developers to quickly build, tune, and execute machine learning models with just a few SQL-like commands, without moving or copying data to a different environment. A significant advantage of Cloud IntegratedML is the elimination of the need to transfer or replicate data to an external platform to build ML models, or to move ML models to a different environment for execution. Cloud IntegratedML delivers the following benefits for SQL developers: Automation of machine learning processes and resource-intensive tasks such as feature engineering, model development, and fine-tuning Seamless integration of models developed and trained with Cloud IntegratedML within Cloud SQL, facilitating real-time predictive insights and prescriptive actions in response to events and transactions This comprehensive suite of smart data services establishes the InterSystems Cloud portfolio of smart data services as an optimal choice for SQL developers seeking a robust, high-performance database solution tailored to their needs. The new Cloud SQL and Cloud IntegratedML services are available through InterSystems Developer Hub. About InterSystems Established in 1978, InterSystems is the leading provider of next-generation solutions for enterprise digital transformations in the healthcare, finance, manufacturing, and supply chain sectors. Its cloud-first data platforms solve interoperability, speed, and scalability problems for large organizations around the globe. InterSystems is committed to excellence through its award-winning, 24×7 support for customers and partners in more than 80 countries. Privately held and headquartered in Cambridge, Massachusetts, InterSystems has 38 offices in 28 countries worldwide. For more information, please visit InterSystems.com.

Read More

Big Data Management

data.world Integrates with Snowflake Data Quality Metrics to Bolster Data Trust

data.world | January 24, 2024

data.world, the data catalog platform company, today announced an integration with Snowflake, the Data Cloud company, that brings new data quality metrics and measurement capabilities to enterprises. The data.world Snowflake Collector now empowers enterprise data teams to measure data quality across their organization on-demand, unifying data quality and analytics. Customers can now achieve greater trust in their data quality and downstream analytics to support mission-critical applications, confident data-driven decision-making, and AI initiatives. Data quality remains one of the top concerns for chief data officers and a critical barrier to creating a data-driven culture. Traditionally, data quality assurance has relied on manual oversight – a process that’s tedious and fraught with inefficacy. The data.world Data Catalog Platform now delivers Snowflake data quality metrics directly to customers, streamlining quality assurance timelines and accelerating data-first initiatives. Data consumers can access contextual information in the catalog or directly within tools such as Tableau and PowerBI via Hoots – data.world’s embedded trust badges – that broadcast data health status and catalog context, bolstering transparency and trust. Additionally, teams can link certification and DataOps workflows to Snowflake's data quality metrics to automate manual workflows and quality alerts. Backed by a knowledge graph architecture, data.world provides greater insight into data quality scores via intelligence on data provenance, usage, and context – all of which support DataOps and governance workflows. “Data trust is increasingly crucial to every facet of business and data teams are struggling to verify the quality of their data, facing increased scrutiny from developers and decision-makers alike on the downstream impacts of their work, including analytics – and soon enough, AI applications,” said Jeff Hollan, Director, Product Management at Snowflake. “Our collaboration with data.world enables data teams and decision-makers to verify and trust their data’s quality to use in mission-critical applications and analytics across their business.” “High-quality data has always been a priority among enterprise data teams and decision-makers. As enterprise AI ambitions grow, the number one priority is ensuring the data powering generative AI is clean, consistent, and contextual,” said Bryon Jacob, CTO at data.world. “Alongside Snowflake, we’re taking steps to ensure data scientists, analysts, and leaders can confidently feed AI and analytics applications data that delivers high-quality insights, and supports the type of decision-making that drives their business forward.” The integration builds on the robust collaboration between data.world and Snowflake. Most recently, the companies announced an exclusive offering for joint customers, streamlining adoption timelines and offering a new attractive price point. The data.world's knowledge graph-powered data catalog already offers unique benefits for Snowflake customers, including support for Snowpark. This offering is now available to all data.world enterprise customers using the Snowflake Collector, as well as customers taking advantage of the Snowflake-only offering. To learn more about the data quality integration or the data.world data catalog platform, visit data.world. About data.world data.world is the data catalog platform built for your AI future. Its cloud-native SaaS (software-as-a-service) platform combines a consumer-grade user experience with a powerful Knowledge Graph to deliver enhanced data discovery, agile data governance, and actionable insights. data.world is a Certified B Corporation and public benefit corporation and home to the world’s largest collaborative open data community with more than two million members, including ninety percent of the Fortune 500. Our company has 76 patents and has been named one of Austin’s Best Places to Work seven years in a row.

Read More

Spotlight

Black In Data

Black in Data is a collaborative movement for people of colour, striving to promote equality of opportunity and representation within the data industry. Diversity and equality at work is crucial not only for social justice, but also for the development of a flourishing data and technology industry, ...

Events

Resources