Even after anti-racism training AI chatbots like ChatGPT still exhibit racial prejudice

Researchers say that LLM makers like OpenAI need to more thoroughly vet their AIs for "covert racism". (Image: OpenAI)

AI chatbots like ChatGPT-4 can still produce racially prejudiced responses even after safety training, researchers have found. The study highlights the need for greater care and vetting for “covert prejudice” before LLMs are made publicly available.

Sanjiv Sathiah, Published 03/11/2024 🇫🇷 🇪🇸 ...

AI Science

Researchers who have been testing AI chatbots based on large language models like OpenAI’s Chat GPT4 have discovered that they can still exhibit racial prejudice, even after undergoing anti-racism training. The latest development follows Google’s recent Gemini AI controversy after its new LLM over-corrected for racism, generating what some called “woke” reinterpretations of history where African American men, for example, were depicted as Nazi soldiers from World War II. Getting the balance right on race, it seems, is proving difficult for creators of LLM models.

In the latest study, highlighted by New Scientist, researchers discovered that dozens of different LLM models they tested still showed racial bias when presented with text using African American dialects. This was despite the tested models being specifically trained to avoid racial bias in the responses the chatbots provide. This includes OpenAI’s ChatGPT-4 and GPT-3.5 models. In one instance, GPT-4 was shown to be more inclined to recommend a death sentence if they speak using English with an African American dialect.

The same “covert prejudice” was also apparent in job recommendations which matched African Americans to careers that were less likely to require a degree or go as far as to associate people of African American heritage without a job, when compared to standard American English-based input. The researchers also found that the larger the language model, the greater the likelihood of it exhibiting these underlying biases. The study raises concerns regarding the use of generative AI technologies for screening purposes, including reviewing job applications.

The researchers concluded that their study raises questions about the effectiveness of human-based AI safety training interventions, which only appear to remove racism and bias at a high-level, but struggle with rooting it out of current models at a lower-level where specific racially defining identity terminology isn’t mentioned during inputs by users. The researchers recommend that companies developing LLMs need to be careful about releasing LLM chatbots to the public before they have been thoroughly vetted.

Source(s)

New Scientist [sub. req.]

MIT demonstates that chatting with a 60-year-old version of oneself in Future You can help improve feelings of well-being. (Image source: MIT)

MIT researchers create Future You AI so young users can chat with 60-year-old versions of themselves to increase motivation and well-being while reducing anxiety and negativity 10/14/2024

ChatGPT on Mac has issues. (Source: OpenAI)

OpenAI app for macOS updates in response to non-encrypted chats fiasco 07/06/2024

AI research meeting (Generated using DALL·E 3)

New training approach aims to reduce social bias in AI 06/26/2024

The AI wars: Microsoft unveils Phi-3, a capable AI model that fits easily on a phone 04/26/2024

New York City's official government chatbot seems to be confused about some laws and regulations. (Image: DALL-E 3 AI-generated image)

New York City government chatbot advises businesses to break laws 04/05/2024

LAUSD launches Ed AI to monitor student progress while creating custom learning plans and updating parents of grades and progress. (Source: LAUSD)

Los Angeles Unified School District launches Ed AI chatbot to answer student questions, plan learning activities, and track progress, and report grades and attendance 03/22/2024

Free users of Microsoft Copilot now have access to the advanced GPT-4 Turbo (Image source: Microsoft)

Microsoft Copilot now offers OpenAI's GPT-4 Turbo for free 03/14/2024

A recent survey reveals that teens have more complex thoughts on smartphone usage than you might expect. (Image source: Robin Worrall on Unsplash - edited)

Teens divided on smartphones: peace, also anxiety reported as smartphone separation side-effects 03/13/2024

Elon Musk's xAI seems to be challenging OpenAI's original mission statement in its latest move to go open-source. (Image source: xAI / OpenAI - edited)

Grok AI goes open-source in Elon Musk's latest move in xAI's spat against OpenAI 03/12/2024

Discrimination in job evaluation and selection by OpenAI's GPT-3 and GPT-4 (symbolic image: DALL-E / AI, edited)

AI-based recruitment: Experiment exposes racism by OpenAI's GPT 03/10/2024

Qualcomm's new AI Hub offers a wealth of resources for developers. (Image: Qualcomm)

Qualcomm AI Hub offers access to generative AI models compatible with its Snapdragon chips 03/09/2024

Microsoft Seeing AI leverages Azure AI tech to help people with low-vision to hear the world around them described. (Source: Microsoft)

Microsoft releases new version of Seeing AI mobile app to help low-vision users by describing the world using Azure AI 03/08/2024

It appears that, even without direct prompts, AI image generators are able to recreate classic photos, like the Lunch Atop a Skyscraper. (Image source: Public domain / DALL-E via PetaPixel)

Photographer highlights generative AI copyright minefield, creates convincing copies of "most iconic photos of all time" 03/08/2024

Human-like robots seem to be the next big thing in high-tech. (Image source: DallE 3)

Bezos, Nvidia, OpenAI, Microsoft, Intel, Samsung, invest millions in human-like robot startup 02/24/2024

OpenAI reaches $80b valuation while CEO Altman preps for AI chip manufacturing 02/17/2024

OpenAI introduces Sora, offering photorealistic text to video generation (Source: OpenAI)

OpenAI reveals Sora, an AI model that can generate photorealistic videos from textual prompts 02/15/2024

'Bard' might soon become 'Gemini' (Image source: Google Blog)

Google to rebrand Bard as ‘Gemini’ and release an app 02/05/2024

Tang Tan, current Apple VP iPhone and Watch product design. (Source: Economic Times)

Current Apple iPhone design chief leaving to make AI phone with Jony Ive and OpenAI 12/29/2023

ChatGPT’s OpenAI set to raise new funding at $100B valuation 12/24/2023

Loading Comments

Comment on this article

Los Angeles Police Department warni...

Snapdragon 8s Gen 3: Qualcomm confi...

Sanjiv Sathiah - Senior Tech Writer - 1467 articles published on Notebookcheck since 2017

I have been writing about consumer technology over the past ten years, previously with the former MacNN and Electronista, and now Notebookcheck since 2017. My first computer was an Apple ][c and this sparked a passion for Apple, but also technology in general. In the past decade, I’ve become increasingly platform agnostic and love to get my hands on and explore as much technology as I can get my hand on. Whether it is Windows, Mac, iOS, Android, Linux, Nintendo, Xbox, or PlayStation, each has plenty to offer and has given me great joy exploring them all. I was drawn to writing about tech because I love learning about the latest devices and also sharing whatever insights my experience can bring to the site and its readership.

contact me via: @t3mporarybl1p

Please share our article, every link counts!