There aren’t many industries that are going to completely change over the next three years, but there are a few that are already making major shifts in how they operate. Machine Learning holds significant promise for many of these high-growth industries, but it’s going to require significant changes in how we think about Machine Learning and data.
Machine Learning is at the heart of nearly every industry in the world, from banking and insurance to retail and healthcare. With the recent rise in popularity of Artificial Intelligence and Machine Learning, it’s no wonder that more and more companies are investing in the technology.
Everything you have read until this point has been written by Helm’s Machine Learning model – a bot – given only a headline and a single sentence to work with. This is similar to what the popular ChatGPT model has been doing for numerous users since the end of November 2022. Will we start to see more of this in the next year?
Globally, it is predicted that in 2023, artificial intelligence (AI) will become more prevalent, along with natural language processing (NLP) and machine learning (ML) advancement – areas Helm has been innovating in for several years already. Along with computer vision, speech-to-text, and all marginal aspects of automation, we will see a massive impact on how technology is going to shape the future.
Helm’s head of engineering, Dr Colin Schwegmann, and natural language processing specialist, Ari Ramkilowan – a member of the team who recently won Wikimedia Foundation's Research Award of the Year – weigh in on the three key trends they’re expecting to see and unpack how they will be adapted for the average person.
1. Unsupervised/self-supervised learning
Schwegmann explains there is many data played with in different spaces, e.g. images for image-based clients, and text for text-based clients. One of the most difficult parts is labelling that information, which entails adding specific human-verified names to images or text. “There is so much data that it cannot always be labelled. So we are starting to see an increase in self-supervised or unsupervised learning – using the data, we have to learn the shapes and types of info we have, instead of labelling it all – by finding patterns inside the data. We have recently successfully done this with Dr Oetker, where we’ve used high-resolution cameras and computer vision to create an AI-based application capable of identifying pizzas on the production line that do not meet the brand’s strict quality control standards.”
Self-supervised learning occurs when you provide unstructured data to a ML algorithm and ask it to learn patterns that become useful. A classic example of self-supervised learning for text is a language model. Language models, such as GPT3, are fed masses of text data and then trained to predict the next word, based on the previous sentence. This is exactly what the trained model, used in the first two paragraphs, did. This also helps the AI to contextualise the given data and reapply it for future tasks.
Self-supervised learning in imagery, however, is subtly different from text, and is at the forefront of ML.
Ramkilowan says, “Step away from the paradigm of taking a model and showing it hundreds and hundreds of examples to distinguish between cats and dogs or overtopped and undertopped pizza, or, in text, is our user happy or unhappy at this point? In the past, we would require hundreds or even thousands of examples to train a model. With prompt-based ‘Zero-Shot Learning’, you can give your model a brief description of your task and it will give you the correct response, without explicitly being trained to do so. We’re moving from a data-centric world (where mountains of data are needed by people with coding knowledge) to a people-centric one, where more people without any prior knowledge or training are able to leverage these technologies.”
“That’s the whole self-supervised approach to ML,” says Schwegmann, “it is building our systems on the backs of giants.”
2. Natural language interfaces with technology
“Then moving from self-supervised learning, we will see an increase in both voice-only and multimodal-based applications. I predict more natural language interfaces to be used with technology. For example, today Siri works mainly on an iPhone, sometimes on a laptop. In the future, these kinds of natural language interfaces will be far more pervasive, available on more devices for more complicated tasks – think photo and video editing, but using your voice to instruct the machine,” explains Ramkilowan.
The ability for models to understand users’ intentions and execute tasks will improve dramatically over the next 2 - 5 years. “We are going to move – not away from the graphical user interface – but we are going to have a lot of natural language interfaces with our technology. Therefore, communicating with self-driving cars is going to be a realistic future. The natural language interfaces will be a primary mode of communicating with these devices, and there is currently a focus on how to bridge this future with our own bots and other tools as we are still in the early stages.”
3. Decision intelligence
“We have lots of data that we use to train our systems, but on the flip side, we have lots of usage data that is not used. What could happen is that we see an increased efficiency in AI's ability to discover hidden patterns in unstructured data, allowing us to unlock actionable insights and develop better guidelines and best practices at an inter and intra-organisational level."
For any of these trends to transpire, there are limitations that will require adjustments and improvements, but once made, we will see these predictions unfold. For example, a trained text-based model does not evolve over time and therefore will not understand the changes that come about post-training, e.g. the difference between the pre-COVID world and today.
What effect will this have on the average person?
One of the applications, Github Co-Pilot – which, in one of its uses, can be thought of as an auto-complete engine for software development – is built on OpenAI Codex, a model trained on billions of lines of code. Github Co-Pilot is capable of generating usable code to assist software developers to be more efficient and productive.
“Things like Github Co-Pilot and GPT3 are just scratching the surface of what is possible. In the next wave of model breakthroughs, we will likely see domain-specific proliferation in both creative and scientific fields (like Github Co-Pilot for video editing or biomedicine). We will be able to see these assisting creatives and helping scientists iterate and discover new things in equal measure,” explains Ramkilowan.
“Having the AI will help many fields become more efficient and more productive and accelerate progress in general at a rapid rate and we have seen that now with Github Co-Pilot.”
Five years ago, or even less, we would have said AI would be good at replicating menial tasks, or even those that require some skill, but what the industry has seen, turns this idea on its head, with the debate of even artistic areas veins affected. E.g. Tiktok filters. It is not about replacing jobs, it is about making work or products better, and improving efficiencies, sparking new ideas, and scaling. It is effectively putting the tech of ML into the hands of the average person, allowing them to pursue other interests with the time they would otherwise have spent on tasks AI now does.
Schwegmann concludes, “This way, people have more of an influence on how AI or ML is used going forward.”
For more information, contact Helm at www.helm.africa
.
© Technews Publishing (Pty) Ltd. | All Rights Reserved.