Q&A with Wendy Gonzalez, CEO at Sama

Media 7 | September 17, 2021

Wendy Gonzalez, CEO at Sama, is an executive who is passionate about building high-performing, high-functioning teams that develop and scale innovative, impactful technology. She has two decades of managerial and technology leadership experience for companies including EY, Capgemini, Cycle30 (acquired by Arrow Electronics) and General Communications Inc. She is an active Board Member of the Leila Janah Foundation.

As artificial intelligence continues to become globally adopted, it’s vital that emerging technologies are built on data that’s accurate, diverse, free of bias, and ethically sourced.



MEDIA 7: Could you please take us through your professional journey?
WENDY GONZALEZ:
Of course! I have spent the last two decades gaining managerial and technology leadership experience through my work for companies including Ernst and Young, Capgemini, and General Communications Inc. I also co-founded Cycle30 where we were developing SaaS solutions for Internet of Things (IoT) devices prior to joining Sama six years ago.What drew me to Sama was not only our best-in-class training data solutions that enable businesses to build innovative AI models, but also our strong social impact mission. After serving as President and COO of Sama since 2018, I stepped into the role of Interim CEO when my friend, Sama’s CEO and founder, Leila Janah, passed away in January of 2020. This past December, I was officially named CEO.


M7: For the second time, Sama made the Inc. 5000 List of America’s Fastest-Growing Private Companies! What was the strategy behind achieving such prestigious recognition in the industry?
WG:
I think the result of our recognition on the Inc. 5000 list is two-fold. First, our team at Sama is one of the hardest-working teams in the industry. Each day, we’re diligently working to ensure that we are providing the highest-quality training data for the world’s leading companies such as Walmart, Google, and NVIDIA. The second factor is our unending dedication to innovation. At Sama, we are passionate about building a work environment that fosters experimentation. That’s why we host an annual ‘Innovation Week,’ where we provide employees with the resources they need to work on new projects and try new things. Last year, over 50% of all Innovation Week projects were incorporated into our roadmap and implemented within the following two quarters. Our MicroModel™ technology was actually born from one of these projects, and today helps our clients get higher quality training data in half the time. I am also excited to share that in addition to being recognized on the Inc. 5000 list of one of America’s fastest-growing private companies for the second year in a row, we’ve also been honored with an inclusion in Fast Company’s World Changing Ideas and Forbes AI 50 lists. Additionally, we recently completed our three-year Randomized Controlled Trial with MIT which validated our social impact model that has helped over 55,000 people lift themselves out of poverty through our skills training and AI work.


From helping doctors diagnose cancer to providing traders with more accurate predictions of the stock market, AI can be an incredibly powerful tool in boosting efficiencies across industries, but only if it’s built on training data that’s accurate, diverse, bias-free, and ethically sourced.



M7: What is Sama’s MicroModel technology and how does it automate data annotation?
WG:
Our MicroModel™ technology is a powerful combination of machine learning-assisted annotation and ethical human validation that consistently produces data that’s between 94% and 98% accurate. First, our Machine Learning Assisted Annotation (MAA) powered by MicroModels™ draws from a library of models trained on specific use cases to expedite the labeling process by generating high-quality pre-labeled annotations. Next, our skilled annotators validate those labels instead of annotating from scratch, maximizing efficiency and producing higher quality training data in half the time. While our models are pre-trained on specific use cases, our human-in-the-loop validation helps us improve our models over time. Through the feedback from our skilled annotators, our models constantly learn to make better predictions. The end result for our clients is ultimately a significantly shorter, and higher quality path to production.


M7: What do you see as the most noticeable change right now happening in the workforce, encouraged by the rise of digital technologies?
WG: 
From helping doctors diagnose cancer to providing traders with more accurate predictions of the stock market, AI can be an incredibly powerful tool in boosting efficiencies across industries, but only if it’s built on training data that’s accurate, diverse, bias-free, and ethically sourced. The truth is 8 out of 10 machine learning (ML) projects fail, and 96% run into data quality and labeling problems which can produce inaccurate algorithms and may result in ethical, legal, and safety issues. At Sama, we understand the importance of having accurate, diverse, and bias-free training data to fuel this next generation of AI technology. When it comes to developing AI models, there’s no room to cut corners. We make producing accurate data our number one priority and are constantly innovating in order to provide our customers with the highest quality results to bring their AI models to production efficiently without sacrificing quality.


There are a number of challenges in the AI industry in the COVID-19 era. The first is keeping up with the demand for AI technology.



M7:  What do you believe are the top three challenges in the industry in the post-COVID-19 era?
WG:
There are a number of challenges in the AI industry in the COVID-19 era. The first is keeping up with the demand for AI technology. During the pandemic, we saw a dramatic shift as companies quickly adopted digital technology in order to boost efficiencies, cut costs, and keep up with competitors. This rapid digital transformation has in turn caused the artificial intelligence market to skyrocket. In 2020, the industry was estimated to be worth over $62 billion dollars, that number is expected to jump to more than $93 billion this year. As AI becomes incorporated into every industry, government organization, and virtually every aspect of our lives, companies are under extreme pressure to bring their AI models to production faster to keep up with competitors.

The second challenge is ensuring emerging technologies are built on a foundation of accurate data. Data quality plays an instrumental role in a machine learning model’s performance. Quality does not just mean the difference between your AI project succeeding or failing, but in some cases means the difference between life or death. Imagine, for example, that a self-driving car was built on data that didn’t recognize people, just stop signs? Creating quality data is one of the most expensive and time-consuming parts of building an AI model but it’s also the most crucial. At Sama, we understand accurate and quality data is important at every step of the machine learning process. The definition of what “quality” means however varies depending on each project, that’s why we always start by working with our clients to first define the meaning of quality. Through this process we are able to anticipate and address errors before the annotation process even begins, saving our clients time and money.

The third challenge is knowing what to automate and what not to automate. As we continue to boost efficiencies, it’s important to recognize that technology isn’t always the most effective way to streamline production. What makes our MicroModel™ technology so effective at consistently producing data that’s between 94% and 98% accurate is our human-in-the-loop validation. Our skilled annotators can recognize errors and inconsistencies that would otherwise have led to inaccurate data if left to automation alone. At Sama, we recognize the importance of having a high-performing and innovative team, which is why we’ve been so efficient at providing our customers with the highest quality data.


M7:  Being a business leader, how do you prepare for an AI-centric world?
WG:
In many ways, an AI-centric world is already here. As artificial intelligence continues to become globally adopted, it’s vital that emerging technologies are built on data that’s accurate, diverse, free of bias, and ethically sourced. As a leader in AI, I’m preparing for an AI-centric world by doing what we do best at Sama, which is building a high-performing team that develops and scales innovative, impactful technology for some of the world’s most ambitious AI companies while continuing to advocate for an ethical AI supply chain.

ABOUT SAMA

Sama provides high-quality training data that powers AI technology. The company’s platform is trusted by the world’s most ambitious organizations to develop accurate machine learning models. Used by leading technology companies such as Google, NVIDIA, GM, and Walmart, Sama specializes in image, video, language, and sensor data annotation and validation for machine learning algorithms in industries including manufacturing and robotics, bio and MedTech, autonomous vehicles, entertainment, e-commerce, retail, and agriculture. For more information, visit www.sama.com.

More C-Suite on deck

Q&A with Ben Lamm, Founder and CEO at Hypergiant

Media 7 | February 23, 2021

Ben Lamm, Founder and CEO at Hypergiant, has been launching and building disruptive businesses for over a decade. Prior to Hypergiant, he was the CEO and founder of Conversable, the leading conversational intelligence platform that helps brands reach their customers through automated experiences on all major voice and text platforms. In addition to leading and growing his companies, he is active in angel investing, incubators, and startup communities, with investments in the software and emerging tech space. He has served as an active voice in the tech community, as well as an active mentor to fellow entrepreneurs on how to build disruptive businesses through accelerators and corporate programs.

Read More

‘Data teams are critical in defining and driving business growth metrics’ says Gaurav Rewari, CEO of Mode.

Media 7 | March 4, 2022

Gaurav Rewari, CEO of Mode elaborates on his role as a CEO of Mode Analytics, the most comprehensive platform for collaborative Business Intelligence and Interactive Data Science. Read on to know more about his thoughts on digitization and Mode's brand-new visualization tool, Visual Explorer.

Read More

'Raising the voices of those who may not always be heard is critical,' says Claire Thomas

Media 7 | April 28, 2023

Claire Thomas is responsible for developing and implementing a strategy for diversity, equity, and inclusion (DEI) across Hitachi Vantara through programs that reflect the diverse backgrounds, interests, and passions of their current and future workforce. Continue reading to learn her views on the significance of inclusion and diversity in an organization.

Read More

Q&A with Ben Lamm, Founder and CEO at Hypergiant

Media 7 | February 23, 2021

Ben Lamm, Founder and CEO at Hypergiant, has been launching and building disruptive businesses for over a decade. Prior to Hypergiant, he was the CEO and founder of Conversable, the leading conversational intelligence platform that helps brands reach their customers through automated experiences on all major voice and text platforms. In addition to leading and growing his companies, he is active in angel investing, incubators, and startup communities, with investments in the software and emerging tech space. He has served as an active voice in the tech community, as well as an active mentor to fellow entrepreneurs on how to build disruptive businesses through accelerators and corporate programs.

Read More

‘Data teams are critical in defining and driving business growth metrics’ says Gaurav Rewari, CEO of Mode.

Media 7 | March 4, 2022

Gaurav Rewari, CEO of Mode elaborates on his role as a CEO of Mode Analytics, the most comprehensive platform for collaborative Business Intelligence and Interactive Data Science. Read on to know more about his thoughts on digitization and Mode's brand-new visualization tool, Visual Explorer.

Read More

'Raising the voices of those who may not always be heard is critical,' says Claire Thomas

Media 7 | April 28, 2023

Claire Thomas is responsible for developing and implementing a strategy for diversity, equity, and inclusion (DEI) across Hitachi Vantara through programs that reflect the diverse backgrounds, interests, and passions of their current and future workforce. Continue reading to learn her views on the significance of inclusion and diversity in an organization.

Read More

Related News

Data Visualization

Oracle Enhances Database 23c with AI Vector Search Capabilities

Oracle | September 20, 2023

Oracle introduces AI Vector Search, enabling semantic search and fast similarity queries by storing semantic content as vectors. Oracle Database 23c, "App Simple," streamlines interactions by declaring outcomes, incorporating AI Vector Search, and offering natural language interfaces. RAG combines large language models (LLMs) with private business data for precise responses to natural language queries while maintaining data privacy. Oracle has announced a significant enhancement to its Oracle Database 23c, introducing semantic search capabilities powered by AI vectors. This innovative collection of features, dubbed AI Vector Search, encompasses a suite of functionalities, including a novel vector data type, vector indexes, and SQL operators. This empowers Oracle Database to store semantic content from various sources, such as documents and images, as vectors and use them to run fast similarity queries. Notably, these advancements also facilitate Retrieval Augmented Generation (RAG), a groundbreaking generative AI technique. RAG combines large language models (LLMs) with private business data to deliver precise responses to natural language queries. Importantly, this approach maintains data privacy by excluding sensitive information from LLM training data. Furthermore, Oracle will enable applications built on Oracle Database and Autonomous Database to add an LLM-based natural language interface. Thus allowing end-users to gain a simplified and intuitive way to request the data they need by framing natural language questions. Additionally, Oracle Database tools such as APEX and SQL Developer will receive enhancements with generative AI capabilities, empowering developers to use natural language for creating applications and SQL queries with ease, eliminating the need for manual coding. Oracle Database 23c, codenamed "App Simple," simplifies the way data professionals, developers, and data users interact with data by stating their desired outcomes rather than hand coding. Data systems will generate solutions using new database technologies such as JSON Relational Duality Views and AI Vector Search with new natural language interface capabilities. Additionally, by merging these technologies with Oracle's low-code APEX development framework, developers will be able to create complete apps. This method represents the future of data and application development and will offer huge productivity increases. Juan Loaiza, Executive Vice President of Mission-Critical Database Technologies, Oracle, stated: Oracle Database is the leading repository of business data, and the combination of business data and semantic data is what enterprises need to implement artificial intelligence solutions, [Source – Cision PR Newswire] Searches on a combination of business and semantic data became easier, faster, and more precise when a single database managed both types of data, stated Loaiza. He further explained that by adding AI Vector Search to Oracle Database, Oracle enables customers to quickly and easily access the benefits of artificial intelligence without compromising security, data integrity, or performance. He emphasized that using Oracle AI Vector Search does not require machine learning expertise and that all database users, including developers and administrators, could learn to use it in less than 30 minutes. The latest updates to Oracle Database services and products include: Modern Oracle Database and AI Application Development Oracle Autonomous Database GoldenGate 23c Free Oracle Autonomous Database Free Container Image Oracle APEX Next-generation Oracle Database Product and Services Oracle Database 23c Oracle Globally Distributed Autonomous Database Oracle Exadata Exascale Autonomous Database Elastic Resource Pools Trusted Data Fabric for AI GoldenGate 23c Oracle GoldenGate Veridata 23c (Beta) OCI GoldenGate Oracle Database Appliance X10 Oracle Database infrastructure for small and medium businesses

Read More

Big Data Management

Reltio Set to Transform Data Management with AI-Powered MDM Solution

Reltio | September 20, 2023

Reltio, a leading modern data management solutions provider, has introduced an innovative AI-powered Master Data Management (MDM) solution. For over a decade, Reltio has been a pioneer in data management, offering a leading cloud-native SaaS MDM solution. The firm is poised to preview novel AI-driven capabilities tailored to meet the growing demand for delivering customers more insightful and timely intelligence powered by reliable data. Participants at the upcoming DataDriven23 conference, taking place from October 3 to 5 in Dallas, TX, will have the opportunity to directly engage with Reltio's innovations and witness how AI is poised to revolutionize MDM, enhancing the quality of data that shapes business outcomes. During the event, experts will showcase three pivotal AI-driven solution capabilities designed to drive remarkable 10X enhancements: Generative AI-driven Search and Data Visualizations: Powered by Reltio's AI-powered MDM platform, enterprises can swiftly pinpoint critical data segments and data visualizations to the Reltio UI, facilitated by a conversational interface powered by GenAI. This simplifies the process of data analysis within Reltio. Increased Data Steward Productivity: Harnessing the power of Machine Learning, Reltio's latest capabilities aim to further automate entity resolution and data quality management processes and enhance data precision while boosting data steward efficiency by 10X. Automated Predictions: Reltio's new AI-driven MDM platform empowers businesses to effortlessly anticipate key insights using propensity models. This aids in identifying crucial customer segments, thereby fostering higher customer acquisition and retention rates. Reltio remains committed to continuous investment in machine learning-based automation within Reltio's Connected Data Platform 2023.3. This includes the introduction of new features in entity resolution and data quality anomaly detection designed to expedite MDM implementations, fortify data unification, and elevate the productivity of data teams. About Reltio Reltio, a renowned leader in modern data management, believes steadfastly in the critical role of data in driving business achievements. The firm is well-known for its cutting-edge data management platform, which effortlessly integrates vital data from different sources into a unified repository of reliable information. This combination of pristine, interconnected, and actionable data empowers Reltio's esteemed clientele to enhance operational efficiency, mitigate risk, and foster sustainable growth. The solution has received praise and is utilized by prominent corporate brands across a wide range of sectors in over 140 countries.

Read More

Big Data Management

Microsoft's AI Data Exposure Highlights Challenges in AI Integration

Microsoft | September 22, 2023

AI models rely heavily on vast data volumes for their functionality, thus increasing risks associated with mishandling data in AI projects. Microsoft's AI research team accidentally exposed 38 terabytes of private data on GitHub. Many companies feel compelled to adopt generative AI but lack the expertise to do so effectively. Artificial intelligence (AI) models are renowned for their enormous appetite for data, making them among the most data-intensive computing platforms in existence. While AI holds the potential to revolutionize the world, it is utterly dependent on the availability and ingestion of vast volumes of data. An alarming incident involving Microsoft's AI research team recently highlighted the immense data exposure risks inherent in this technology. The team inadvertently exposed a staggering 38 terabytes of private data when publishing open-source AI training data on the cloud-based code hosting platform GitHub. This exposed data included a complete backup of two Microsoft employees' workstations, containing highly sensitive personal information such as private keys, passwords to internal Microsoft services, and over 30,000 messages from 359 Microsoft employees. The exposure was a result of an accidental configuration, which granted "full control" access instead of "read-only" permissions. This oversight meant that potential attackers could not only view the exposed files but also manipulate, overwrite, or delete them. Although a crisis was narrowly averted in this instance, it serves as a glaring example of the new risks organizations face as they integrate AI more extensively into their operations. With staff engineers increasingly handling vast amounts of specialized and sensitive data to train AI models, it is imperative for companies to establish robust governance policies and educational safeguards to mitigate security risks. Training specialized AI models necessitates specialized data. As organizations of all sizes embrace the advantages AI offers in their day-to-day workflows, IT, data, and security teams must grasp the inherent exposure risks associated with each stage of the AI development process. Open data sharing plays a critical role in AI training, with researchers gathering and disseminating extensive amounts of both external and internal data to build the necessary training datasets for their AI models. However, the more data that is shared, the greater the risk if it is not handled correctly, as evidenced by the Microsoft incident. AI, in many ways, challenges an organization's internal corporate policies like no other technology has done before. To harness AI tools effectively and securely, businesses must first establish a robust data infrastructure to avoid the fundamental pitfalls of AI. Securing the future of AI requires a nuanced approach. Despite concerns about AI's potential risks, organizations should be more concerned about the quality of AI software than the technology turning rogue. PYMNTS Intelligence's research indicates that many companies are uncertain about their readiness for generative AI but still feel compelled to adopt it. A substantial 62% of surveyed executives believe their companies lack the expertise to harness the technology effectively, according to 'Understanding the Future of Generative AI,' a collaboration between PYMNTS and AI-ID. The rapid advancement of computing power and cloud storage infrastructure has reshaped the business landscape, setting the stage for data-driven innovations like AI to revolutionize business processes. While tech giants or well-funded startups primarily produce today's AI models, computing power costs are continually decreasing. In a few years, AI models may become so advanced that everyday consumers can run them on personal devices at home, akin to today's cutting-edge platforms. This juncture signifies a tipping point, where the ever-increasing zettabytes of proprietary data produced each year must be addressed promptly. If not, the risks associated with future innovations will scale up in sync with their capabilities.

Read More

Data Visualization

Oracle Enhances Database 23c with AI Vector Search Capabilities

Oracle | September 20, 2023

Oracle introduces AI Vector Search, enabling semantic search and fast similarity queries by storing semantic content as vectors. Oracle Database 23c, "App Simple," streamlines interactions by declaring outcomes, incorporating AI Vector Search, and offering natural language interfaces. RAG combines large language models (LLMs) with private business data for precise responses to natural language queries while maintaining data privacy. Oracle has announced a significant enhancement to its Oracle Database 23c, introducing semantic search capabilities powered by AI vectors. This innovative collection of features, dubbed AI Vector Search, encompasses a suite of functionalities, including a novel vector data type, vector indexes, and SQL operators. This empowers Oracle Database to store semantic content from various sources, such as documents and images, as vectors and use them to run fast similarity queries. Notably, these advancements also facilitate Retrieval Augmented Generation (RAG), a groundbreaking generative AI technique. RAG combines large language models (LLMs) with private business data to deliver precise responses to natural language queries. Importantly, this approach maintains data privacy by excluding sensitive information from LLM training data. Furthermore, Oracle will enable applications built on Oracle Database and Autonomous Database to add an LLM-based natural language interface. Thus allowing end-users to gain a simplified and intuitive way to request the data they need by framing natural language questions. Additionally, Oracle Database tools such as APEX and SQL Developer will receive enhancements with generative AI capabilities, empowering developers to use natural language for creating applications and SQL queries with ease, eliminating the need for manual coding. Oracle Database 23c, codenamed "App Simple," simplifies the way data professionals, developers, and data users interact with data by stating their desired outcomes rather than hand coding. Data systems will generate solutions using new database technologies such as JSON Relational Duality Views and AI Vector Search with new natural language interface capabilities. Additionally, by merging these technologies with Oracle's low-code APEX development framework, developers will be able to create complete apps. This method represents the future of data and application development and will offer huge productivity increases. Juan Loaiza, Executive Vice President of Mission-Critical Database Technologies, Oracle, stated: Oracle Database is the leading repository of business data, and the combination of business data and semantic data is what enterprises need to implement artificial intelligence solutions, [Source – Cision PR Newswire] Searches on a combination of business and semantic data became easier, faster, and more precise when a single database managed both types of data, stated Loaiza. He further explained that by adding AI Vector Search to Oracle Database, Oracle enables customers to quickly and easily access the benefits of artificial intelligence without compromising security, data integrity, or performance. He emphasized that using Oracle AI Vector Search does not require machine learning expertise and that all database users, including developers and administrators, could learn to use it in less than 30 minutes. The latest updates to Oracle Database services and products include: Modern Oracle Database and AI Application Development Oracle Autonomous Database GoldenGate 23c Free Oracle Autonomous Database Free Container Image Oracle APEX Next-generation Oracle Database Product and Services Oracle Database 23c Oracle Globally Distributed Autonomous Database Oracle Exadata Exascale Autonomous Database Elastic Resource Pools Trusted Data Fabric for AI GoldenGate 23c Oracle GoldenGate Veridata 23c (Beta) OCI GoldenGate Oracle Database Appliance X10 Oracle Database infrastructure for small and medium businesses

Read More

Big Data Management

Reltio Set to Transform Data Management with AI-Powered MDM Solution

Reltio | September 20, 2023

Reltio, a leading modern data management solutions provider, has introduced an innovative AI-powered Master Data Management (MDM) solution. For over a decade, Reltio has been a pioneer in data management, offering a leading cloud-native SaaS MDM solution. The firm is poised to preview novel AI-driven capabilities tailored to meet the growing demand for delivering customers more insightful and timely intelligence powered by reliable data. Participants at the upcoming DataDriven23 conference, taking place from October 3 to 5 in Dallas, TX, will have the opportunity to directly engage with Reltio's innovations and witness how AI is poised to revolutionize MDM, enhancing the quality of data that shapes business outcomes. During the event, experts will showcase three pivotal AI-driven solution capabilities designed to drive remarkable 10X enhancements: Generative AI-driven Search and Data Visualizations: Powered by Reltio's AI-powered MDM platform, enterprises can swiftly pinpoint critical data segments and data visualizations to the Reltio UI, facilitated by a conversational interface powered by GenAI. This simplifies the process of data analysis within Reltio. Increased Data Steward Productivity: Harnessing the power of Machine Learning, Reltio's latest capabilities aim to further automate entity resolution and data quality management processes and enhance data precision while boosting data steward efficiency by 10X. Automated Predictions: Reltio's new AI-driven MDM platform empowers businesses to effortlessly anticipate key insights using propensity models. This aids in identifying crucial customer segments, thereby fostering higher customer acquisition and retention rates. Reltio remains committed to continuous investment in machine learning-based automation within Reltio's Connected Data Platform 2023.3. This includes the introduction of new features in entity resolution and data quality anomaly detection designed to expedite MDM implementations, fortify data unification, and elevate the productivity of data teams. About Reltio Reltio, a renowned leader in modern data management, believes steadfastly in the critical role of data in driving business achievements. The firm is well-known for its cutting-edge data management platform, which effortlessly integrates vital data from different sources into a unified repository of reliable information. This combination of pristine, interconnected, and actionable data empowers Reltio's esteemed clientele to enhance operational efficiency, mitigate risk, and foster sustainable growth. The solution has received praise and is utilized by prominent corporate brands across a wide range of sectors in over 140 countries.

Read More

Big Data Management

Microsoft's AI Data Exposure Highlights Challenges in AI Integration

Microsoft | September 22, 2023

AI models rely heavily on vast data volumes for their functionality, thus increasing risks associated with mishandling data in AI projects. Microsoft's AI research team accidentally exposed 38 terabytes of private data on GitHub. Many companies feel compelled to adopt generative AI but lack the expertise to do so effectively. Artificial intelligence (AI) models are renowned for their enormous appetite for data, making them among the most data-intensive computing platforms in existence. While AI holds the potential to revolutionize the world, it is utterly dependent on the availability and ingestion of vast volumes of data. An alarming incident involving Microsoft's AI research team recently highlighted the immense data exposure risks inherent in this technology. The team inadvertently exposed a staggering 38 terabytes of private data when publishing open-source AI training data on the cloud-based code hosting platform GitHub. This exposed data included a complete backup of two Microsoft employees' workstations, containing highly sensitive personal information such as private keys, passwords to internal Microsoft services, and over 30,000 messages from 359 Microsoft employees. The exposure was a result of an accidental configuration, which granted "full control" access instead of "read-only" permissions. This oversight meant that potential attackers could not only view the exposed files but also manipulate, overwrite, or delete them. Although a crisis was narrowly averted in this instance, it serves as a glaring example of the new risks organizations face as they integrate AI more extensively into their operations. With staff engineers increasingly handling vast amounts of specialized and sensitive data to train AI models, it is imperative for companies to establish robust governance policies and educational safeguards to mitigate security risks. Training specialized AI models necessitates specialized data. As organizations of all sizes embrace the advantages AI offers in their day-to-day workflows, IT, data, and security teams must grasp the inherent exposure risks associated with each stage of the AI development process. Open data sharing plays a critical role in AI training, with researchers gathering and disseminating extensive amounts of both external and internal data to build the necessary training datasets for their AI models. However, the more data that is shared, the greater the risk if it is not handled correctly, as evidenced by the Microsoft incident. AI, in many ways, challenges an organization's internal corporate policies like no other technology has done before. To harness AI tools effectively and securely, businesses must first establish a robust data infrastructure to avoid the fundamental pitfalls of AI. Securing the future of AI requires a nuanced approach. Despite concerns about AI's potential risks, organizations should be more concerned about the quality of AI software than the technology turning rogue. PYMNTS Intelligence's research indicates that many companies are uncertain about their readiness for generative AI but still feel compelled to adopt it. A substantial 62% of surveyed executives believe their companies lack the expertise to harness the technology effectively, according to 'Understanding the Future of Generative AI,' a collaboration between PYMNTS and AI-ID. The rapid advancement of computing power and cloud storage infrastructure has reshaped the business landscape, setting the stage for data-driven innovations like AI to revolutionize business processes. While tech giants or well-funded startups primarily produce today's AI models, computing power costs are continually decreasing. In a few years, AI models may become so advanced that everyday consumers can run them on personal devices at home, akin to today's cutting-edge platforms. This juncture signifies a tipping point, where the ever-increasing zettabytes of proprietary data produced each year must be addressed promptly. If not, the risks associated with future innovations will scale up in sync with their capabilities.

Read More

Spotlight

Sama

Sama

Sama provides high quality training data that powers AI technology. The company’s platform is trusted by the world’s most ambitious organizations to develop accurate machine learning models. Used by leading technology companies such as Google, NVIDIA, GM, and Walmart, Sama specializes in image, vide...

Events

Resources

resource image

Business Intelligence, Big Data Management, Big Data

Understanding User Entity and Behavior Analytics (UEBA)

Video

resource image

Business Intelligence, Big Data Management, Data Science

Tackling climate change with data science and AI

Whitepaper

resource image

Business Intelligence, Big Data Management, Data Science

Data Beyond Borders 3.0

Whitepaper

resource image

Business Intelligence, Big Data Management, Big Data

Understanding User Entity and Behavior Analytics (UEBA)

Video

resource image

Business Intelligence, Big Data Management, Data Science

Tackling climate change with data science and AI

Whitepaper

resource image

Business Intelligence, Big Data Management, Data Science

Data Beyond Borders 3.0

Whitepaper

Events