Data quality essential in training ChatGPT

Issue 7 2023 AI & Data Analytics

It is a year since OpenAI launched ChatGPT to the public, with adoption rates skyrocketing at an unprecedented pace. By February 2023, Reuters reported an estimated 100 million active users. Fast forward to September, and the ChatGPT website has attracted nearly 1,5 billion visitors, showcasing the platform’s immense popularity and integral role in today’s digital landscape.

Willem Conradie, CTO of PBT Group, reflects on this journey, noting the significant usage and adoption of ChatGPT across various sectors. “The rise of ChatGPT has highlighted significant concerns. These range from biased outputs, question misinterpretation, inconsistent answers, lack of empathy, and security issues. To navigate these, the concept of Responsible AI has gained momentum, emphasising the importance of applying AI with fair, inclusive, secure, transparent, accountable, and ethical intent. Adopting such an approach is vital, especially when dealing with fabricated information when ChatGPT provides incorrect or outdated information,” says Conradie.

Of course, the platform’s versatility extends beyond public use. It serves as a powerful tool in corporate environments, enhancing various business processes such as customer service enquiries, email drafting, personal assistant tasks, keyword searches, and creating presentations. For the best performance, it is essential that ChatGPT provides accurate responses. This necessitates training on data that is relevant to the company and accurate and timely.

“Consider a scenario where ChatGPT is employed to automatically service customer enquiries, with the aim of enhancing customer experience by delivering personalised responses. If the underlying data quality is compromised, ChatGPT may provide inaccurate responses, ranging from minor errors like incorrect customer names to major issues like providing incorrect self-help instructions on the company’s mobile app. Such inaccuracies could lead to customer frustration, ultimately damaging the customer experience and negating the intended positive outcomes.”

Addressing such data quality concerns is paramount. Ensuring relevance is the first step. This requires the data used for model training to align with the business context in which ChatGPT operates. Timeliness is another critical factor, as outdated data could lead to inaccurate responses. The data must also be complete. Ensuring the dataset is free from missing values, duplicates, or irrelevant entries is important, as these could also result in incorrect responses and actions.

Moreover, continuously improving the model through reinforcement learning incorporating user feedback into model retraining cycles, is essential. This assists ChatGPT, and conversational AI models in general, to learn from their interactions, adapt, and enhance their response quality over time.

“The data quality management practices highlighted here, while not exhaustive, serve as a practical starting point. They are applicable not just to ChatGPT, but to conversational AI and other AI applications like generative AI. All this reinforces the importance of data quality across the spectrum of AI technologies,” concludes Conradie.




Share this article:
Share via emailShare via LinkedInPrint this page



Further reading:

Can AI improve intruder detection?
DeepAlert AI & Data Analytics
Traditional surveillance methods are increasingly inadequate in today’s security landscape. For security companies, integrating AI-powered CCTV with cloud computing offers a transformative solution that enhances security and delivers significant cost savings.

Read more...
Workforce Consortium to reskill 95 million people
Editor's Choice News & Events AI & Data Analytics
ICT Workforce Consortium of global leaders has come together, committing to train and upskill 95 million people over the next 10 years, as 92% of jobs analysed are expected to undergo either high or moderate transformation due to advancements in AI.

Read more...
Cellular IoT connectivity revenues reached € 12,4 billion in 2023
IoT & Automation AI & Data Analytics
A new report from Berg Insight says that global IoT connectivity revenues increased 16% to reach €12,4 billion in 2023 as the industry's advancement drives a shift towards a greater focus on reliability, security, and support for international deployments.

Read more...
AI in check
AI & Data Analytics IoT & Automation
While AI has many applications and benefits, and businesses are exploring its use in various ways, there is also a level of risk involved, particularly when it comes to the data that AI uses.

Read more...
Latest AI solution to manage guards
DeepAlert Products & Solutions Surveillance AI & Data Analytics
No guard at the guardhouse? Guard under duress? Guard asleep? DeepAlert’s AI technology delivers real-time alerts to mobile phones and video management systems, helping you manage your guards more effectively.

Read more...
The critical role of data quality KPIs in driving business success
Editor's Choice Security Services & Risk Management AI & Data Analytics
Data is gold in our increasingly digitised world and needs to be refined to unlock its real value. Unrefined data can damage businesses, their competitiveness, and their ability to capitalise on opportunities.

Read more...
Zimbabwe's police nail criminal syndicate with facial recognition
NEC XON Editor's Choice Access Control & Identity Management News & Events AI & Data Analytics
NEC XON shared that its NeoFace Watch facial recognition system was successfully used by the Zimbabwe Republic Police (ZRP) to identify and apprehend a Chinese syndicate attempting to enter the country using fraudulent travel documents.

Read more...
Responsible AI in security
DeepAlert AI & Data Analytics
As AI continues to revolutionise the field of video surveillance, it is crucial to carefully weigh its benefits against the ethical and legal considerations that have been raised by individuals and organisations around the world.

Read more...
On-camera AI and storage create added benefits
Elvey Security Technologies AI & Data Analytics Surveillance IoT & Automation
The days of standalone security systems are long past, and the drive is now to educate system integrators, installers, and end users on the return on investment that can be derived from intelligent, integrated BMS, IoT and security systems.

Read more...
Artificial intelligence on the edge
Axis Communications SA Surveillance AI & Data Analytics
In the world of video surveillance, one of the primary benefits of edge computing will be the ability to undertake advanced analytics using artificial intelligence (AI) and deep learning within cameras themselves.

Read more...