Hadoop Happenings: Announcements and New Releases

| April 12, 2016

article image
Grab the latest news and commentary on Hadoop in this week’s Hadoop Happenings. This week LinkedIn released another Hadoop tool, Hortonworks made several announcements, and MarketShare shared what it has learned since its Hadoop deployment. See the full stories below.

Spotlight

Quora

Quora is a Q&A platform that empowers people to share and grow the world’s knowledge. The vast majority of human knowledge is still not on the internet. Most of it is trapped in the form of experience in people’s heads, or buried in books and papers that only experts can access. More than a billion people use the internet yet only a tiny fraction contribute their knowledge to it. We want to democratize access to knowledge of all kinds—from politics to painting, cooking to coding, etymology to experiences—so if someone out there knows something, anyone else can learn it. Our mission is to share and grow the world's knowledge, and we're building a world-class team to help us achieve this mission.

OTHER ARTICLES

Why Data Science Needs DataOps

Article | March 31, 2020

DataOps helps reduce the time data scientists spend preparing data for use in applications. Such tasks consume roughly 80% of their time now.We’re still hopeful that the digital transformation will provide the insights businesses need from big data. As a data scientist, you’re probably aware of the growing pressure from companies to extract meaningful insights from data and find the stories needed for impact.No matter how in-demand data science is in the employment numbers, equal pressure is rising for data scientists to deliver business value and no wonder. We’re approaching the age where data science and AI draw a line in the sand for which companies remain competitive and which ones collapse.One answer to this pressure is the rise of DataOps. Let’s take a look at what it is and how it could provide a path for data scientists to give businesses what they’ve been after.

Read More

Saurav Singla, the machine learning guru, empowering society

Article | December 10, 2020

Saurav Singla is a Senior Data Scientist, a Machine Learning Expert, an Author, a Technical Writer, a Data Science Course Creator and Instructor, a Mentor, a Speaker. While Media 7 has followed Saurav Singla’s story closely, this chat with Saurav was about analytics, his journey as a data scientist, and what he brings to the table with his 15 years of extensive statistical modeling, machine learning, natural language processing, deep learning, and data analytics across Consumer Durable, Retail, Finance, Energy, Human Resource and Healthcare sectors. He has grown multiple businesses in the past and is still a researcher at heart. In the past, Analytics and Predictive Modeling is predominant in few industries but in current times becoming an eminent part of emerging fields such as health, human resource management, pharma, IoT, and other smart solutions as well. Saurav had worked in data science since 2003. Over the years, he realized that all the people they had hired — whether they are from business or engineering backgrounds — needed extensive training to be able to perform analytics on real-world business datasets. He got an opportunity to move to Australia in the year 2003. He joined a retail company Harvey Norman in Australia, working out of their Melbourne office for four years. After moving back to India, in 2008, he joined one of the verticals of Siemens — one of the few companies in India then using analytics services in-house for eight years. He is a very passionate believer that the use of data and analytics will dramatically change not only corporations but also our societies. Building and expanding the application of analytics for supply chain, logistics, sales, marketing, finance at Siemens was a very fulfilling and enjoyable experience for him. Siemens was a tremendously rewarding and enjoyable experience for him. He grew the team from zero to fifteen while he was the data scientist leader. He believes those eight years taught him how to think big, scale organizations using data science. He has demonstrated success in developing and seamlessly executing plans in complex organizational structures. He has also been recognized for maximizing performance by implementing appropriate project management tools through analysis of details to ensure quality control and understanding of emerging technology. In the year 2016, he started getting a serious inner push to start thinking about joining a consulting and shifted to a company based out in Delhi NCR. During his ten-month path with them, he improved the way clients and businesses implement and exploit machine learning in their consumer commitments. As part of that vision, he developed class-defining applications that eliminate tension technologies, processes, and humans. Another main aspect of his plan was to ensure that it was affected in very fast agile cycles. Towards that he was actively innovating on operating and engagement models. In the year 2017, he moved to London and joined a digital technology company, and assisted in building artificial intelligence and machine learning products for their clients. He aimed to solve problems and transform the costs using technology and machine learning. He was associated with them for 2 years. At the beginning of the year 2018, he joined Mindrops. He developed advanced machine learning technologies and processes to solve client problems. Mentored the Data Science function and guide them in the development of the solution. He built robust clients Data Science capabilities which can be scalable across multiple business use cases. Outside work, Saurav associated with Mentoring Club and Revive. He volunteers in his spare time for helping, coaching, and mentoring young people in taking up careers in the data science domain, data practitioners to build high-performing teams and grow the industry. He assists data science enthusiasts to stay motivated and guide them along their career path. He helps fill the knowledge gap and help aspirants understand the core of the industry. He helps aspirants analyze their progress and help them upskill accordingly. He also helps them connect with potential job opportunities with their industry-leading network. Additionally, in the year 2018, he joined as a mentor in the Transaction Behavioral Intelligence company that accelerates business growth for banks with the use of Artificial Intelligence and Machine Learning enabled products. He is guiding their machine learning engineers with their projects. He is enhancing the capabilities of their AI-driven recommendation engine product. Saurav is teaching the learners to grasp data science knowledge more engaging way by providing courses on the Udemy marketplace. He has created two courses on Udemy, with over twenty thousand students enrolled in it. He regularly speaks at meetups on data science topics and writes articles on data science topics in major publications such as AI Time Journal, Towards Data Science, Data Science Central, Kdnuggets, Data-Driven Investor, HackerNoon, and Infotech Report. He actively contributes academic research papers in machine learning, deep learning, natural language processing, statistics and artificial intelligence. His book on Machine Learning for Finance was published by BPB Publications which is Asia's largest publisher of Computer and IT Books. This is possibly one of the biggest milestones of his career. Saurav turned his passion to make knowledge available for society. Saurav believes sharing knowledge is cool, and he wishes everyone should have that passion for knowledge sharing. That would be his success.

Read More

Deep Dive Digital-First Banks Harness The Power Of Data Analytics

Article | April 2, 2020

Data analytics has many purposes in the banking industry, ranging from improving cybersecurity to reducing customer churn. Every interaction from ATM withdrawals to loan applications — provides FIs with valuable data about customers’ financial lifestyles. Banks can even harness external regulatory, trading and social media engagement data, all of which can be processed and analyzed to benefit their operations.Financial data is useful in helping banks develop wide-reaching marketing campaigns, but social data is critical to developing offers for specific customers. Santa Rosa, California-based Redwood Credit Union, for example, found that social data was particularly important when offering auto loans. It initially extended preapproval for such loans every two years based solely on members’ credit scores and vehicle purchase histories, but it soon discovered that there was a much more reliable indicator and updated its preapproval frequency accordingly.

Read More

THE NOT-SO-DISTANT FUTURE OF WORK

Article | November 20, 2020

As smart machines, data, and algorithms usher in dramatic technological transformation, its global impact spans from cautious optimism to doomsday scenarios. Widespread transformation, displacement, and disaggregation of world labor markets is speculated in countries like India, with an estimated 600 million workforce by 2022, as well as the global labor market. Even today, we are witnessing the resurgence of 'hybrid' jobs where distinctive human abilities are paired with data and algorithms, and 'super' jobs that involve deep tech. Our historical response to such tectonic shifts and upheavals has been predictable so far - responding with trepidation and uncertainty in the beginning followed by a period of painful transition. Communities and nations that can sense and respond will be able to shape social, economic, and political order decisively. However, with general AI predictably coming of age by 2050-60, governments will need to frame effective policies to respond to their obligations to their citizens. This involves the creation of a new social contract between the individual, enterprise, and state for an inclusive and equitable society. The present age is marked by automation, augmentation, and amplification of human talent by transformative technologies. A typical career may go through 15-20 transitions. And given the gig economy, the shelf-life of skills is rapidly shrinking. Many agree that for the next 30 years, the nature and the volume of jobs will get significantly redefined. So even as it is nearly impossible to gaze into the crystal ball 100 years later, one can take a shot at what jobs may emerge in the next 20-30 years given the present state. So here is a glimpse into the kind of technological changes the next generation might witness that will change the employment scenario: RESTORATION OF BIODIVERSITY Our biodiversity is shrinking frighteningly fast - for both flora and fauna. Extinct species revivalists may be challenged with restoring and reintegrating pertinent elements back into the natural environment. Without biodiversity, humanity will perish. PERSONALIZED HEALTHCARE Medicine is rapidly getting personalized as genome sequencing becomes commonplace. Even today, Elon Musk's Neuralink is working on brain-machine interfaces. So you may soon be able to upload your brain onto a computer where it can be edited, transformed, and re-uploaded back into you. Anti-aging practitioners will be tasked with enhancing human life-spans to ensure we stay productive late into our twilight years. Gene sequencers will help personalize treatments and epigenetic therapists will manipulate gene expression to overcome disease and decay. Brain neurostimulation experts and augmentationists may be commonplace to ensure we are happier, healthier, and disease-free. In fact, happiness itself may get redefined as it shifts from the quality of our relationships to that between man-machine integration. THE QUANTIFIED SELF As more of the populace interact and engage with a digitized world, digital rehabilitators will help you detox and regain your sense of self, which may get inseparably intertwined with smart machines and interfaces. DATA-LED VALUE CREATION Data is exploding at a torrid pace and becoming a source of value-creation. While today's organizations are scrambling to create data lakes, future data-centers will be entrusted with sourcing high-value data, securing rights to it, and even licensing it to others. Data will increasingly create competitive asymmetries amongst organizations and nations. Data brokers will be the new intermediaries and data detectives, analysts, monitors or watchers, auditors, and frackers will emerge as new-age roles. Since data and privacy issues are entwined together, data regulators, ethicists, and trust professionals will thrive. Many new cyber laws will come into existence. HEALING THE PLANET As the world grapples with the specter of climate change, our focus on sustainability and clean energy will intensify. Our landfills are choked with both toxic and non-toxic waste. Plastic alone takes almost 1000 years to degrade, so landfill operators will use earthworm-like robots to help decompose waste and recoup precious recyclable waste. Nuclear fusion will emerge as the new source of clean energy, creating a broad gamut of engineers, designers, integrators, architects, and planners around it. We may even generate power in space. Since our oceans are infested with waste, a lot of initiatives and roles will emerge around cleaning the marine environment to ensure natural habitat and food security. TAMING THE GENOME As technologies like CRISPR and Prime-editing mature, we may see a resurgence of biohackers and programmable healthcare. Our health and nutrition may be algorithmically managed. CRISPR-like advancements will need a swathe of engineers, technicians, auditors, and regulators for genetically engineered health that may overcome a wide variety of diseases for longer life-expectancy. THE RISE OF BOTS Humanoid and non-humanoid robots will need entire workforce ecosystems around them spanning from suppliers, programmers, operators, and maintenance experts to ethicists and UI-designers. Smart robot psychologists will have to counsel them and ensure they are safe and friendly. Regulators may grant varying levels of autonomy to robots. DATA LOADS THE GUN, CREATIVITY FIRES THE TRIGGER Today's deep-learning Generative Adversarial Networks (GANs) can create music like Mozart and paintings like Picasso. Such advancements will give birth to a wide array of AI-enhanced professionals, like musicians, painters, authors, quantum programmers, cybersecurity experts, educators, etc. FROM AUGMENTATION TO AUTONOMY Autonomous driving is about to mature in the next few years and will extend to air and space travel. Safety will exceed human capabilities and we may soon reach a state of diminishing returns where we will employ fewer humans to prevent mishaps and unforeseen occurrences. This industry will need supportive command center managers, traffic analyzers, fleet managers, and people to ensure onboarding experience. BLOCKCHAIN BECOMES PERVASIVE Blockchain will create a lot of jobs for its mainstream and derivative applications. Even though most of its present applications are in Financial Services, Supply Chain, and Asset Management industries, very soon its adoption and integration will be a lot more expansive. Engineers, designers, UI/UX experts, analysts, auditors, and regulators will be required to manage blockchain-related applications. With Crypto being one of its better-known applications, a lot of transaction specialists, miners, insurers, wealth managers, and regulators will be needed. Crypto exchanges will come under the purview of the regulatory framework. 3D PRINTING TURNS GAME-CHANGER Additive manufacturing, also popularly called 3D printing, will mature in its precision, capabilities, and market potential. Lab-grown, 3D-printed food will be part of our regular diet. Transplantable organs will be generated using stem cell research and 3D printing. Amputees and the disabled will adopt 3D-printed limbs and prosthetics. Its applications for high-precision reconstructive surgery are already commonplace. Pills are being 3D printed as we speak. So again, we are looking at 3D printers, operators, material scientists, pharmacists, construction experts, etc. THE COLONIZATION OF OUTER SPACE Amazon's Blue Origin and Elon Musk's SpaceX signal a new horizon. As space tech gets into a new trajectory, a new breed of commercial space pilots, mission planners, launch managers, cargo experts, ground crew, experience designers, etc. will be required. Since we have ravaged the limited resources of our planet already, mankind will need to venture into asteroid mining for rare and precious metals. This will need scouts and surveyors, meteorologists, remote bot operators, remotely managed factories, and whatnot. THE HYPER-CONNECTED WORLD By 2020, we already have anywhere between 50-75 billion connected devices. By 2040, this will likely swell to more than 100 trillion sensors that will spew out a dizzying volume of real-time data ready for analytics and AI. A complete IoT system as we know it is aware, autonomous, and actionable, just like a self-driving car. Imagine the number of data modelers, sensor designers and installers, signal architects and engineers that will be needed. Home automation will be pervasive and smart medicines, implants, and wearables will be the norms of the day. DRONES USHER IN DISRUPTION Unmanned aerial and underwater drones are already becoming ubiquitous for applications in aerial surveillance, delivery, and security. Countries are awakening to their potential as well as possibilities of misuse. Command centers, just like that for space travel, will manage them as countries rush to put in a regulatory framework around them. An army of designers, programmers, security experts, traffic flow optimizers will harness their true potential. SHIELDING YOUR DATA With data come cyber threats, data breaches, cyber warfare, cyber espionage, and a host of other issues. The more data-dependent and connected the world is, the bigger the problem of cybersecurity will be. The severity of the problem will increase manifold from the current issues like phishing, spyware, malware, viruses and worms, ransomware, DoS/ DDoS attacks, hacktivism, and cybersecurity will indeed be big business. The problem is that threats are increasing 10X faster than investments in this space and the interesting thing is that it is a lot more about audits, governance, policies, and compliance than technology alone. FOOD-TECH COMES OF AGE As the world population grows to 9.7 billion people in 2050, cultured food and lab-grown meat will hit our tables to ensure food security. Entire food chains and value delivery networks will see an unprecedented change. Agriculture will be transformed with robotics, IoT, drones, and the food-tech sector will take off in a big way. QUANTUM COMPUTING SOLVES INTRACTABLE PROBLEMS Finally, while the list is very long, let’s touch upon the advent of qubits, or Quantum computing. With its ability to break the best encryption on the planet, the traditional asymmetric encryption, public key infrastructure, digital envelopes, and digital certificates in use today will be rendered useless. Bring in the quantum programmers, analysts, privacy and trust managers, health monitors, etc. As we brace for the world that looms large ahead of us, the biggest enabler that will be transformed itself will be Education 4.0. Education will cease to be a phase in your life. Life-long interventions will be needed to adapt, impart, and shape the skills of individuals that are ready for the future of work. More power to the people!

Read More

Spotlight

Quora

Quora is a Q&A platform that empowers people to share and grow the world’s knowledge. The vast majority of human knowledge is still not on the internet. Most of it is trapped in the form of experience in people’s heads, or buried in books and papers that only experts can access. More than a billion people use the internet yet only a tiny fraction contribute their knowledge to it. We want to democratize access to knowledge of all kinds—from politics to painting, cooking to coding, etymology to experiences—so if someone out there knows something, anyone else can learn it. Our mission is to share and grow the world's knowledge, and we're building a world-class team to help us achieve this mission.

Events