Researchers in China developed a hallucination correction engine for AI models https://ift.tt/KkMYZXE

Technology October 25, 2023

The “Woodpecker” hallucination correction system can, ostensibly, be applied to any multimodal large language model, according to the research.

A team of scientists from the University of Science and Technology of China and Tencent’s YouTu Lab have developed a tool to combat “hallucination” by artificial intelligence (AI) models.

Hallucination is the tendency for an AI model to generate outputs with a high level of confidence that don’t appear based on information present in its training data. This problem permeates large language model (LLM) research, and its effects can be seen in models such as OpenAI’s ChatGPT and Anthropic’s Claude.

The USTC/Tencent team developed a tool called “Woodpecker” that they claim is capable of correcting hallucinations in multimodal large language models (MLLMs).

This subset of AI involves models such as GPT-4 (especially its visual variant, GPT-4V) and other systems that roll vision and/or other processing into the generative AI modality alongside text-based language modeling.

According to the team’s preprint research paper, Woodpecker uses three separate AI models, apart from the MLLM being corrected for hallucinations, to perform hallucination correction.

These include GPT-3.5 turbo, Grounding DINO and BLIP-2-FlanT5. Together, these models work as evaluators to identify hallucinations and instruct the model being corrected to regenerate its output in accordance with its data.

In each of the above examples, an LLM hallucinates an incorrect answer (green background) to prompting (blue background). The corrected Woodpecker responses are shown with a red background. Source: Yin, et. al., 2023

To correct hallucinations, the AI models powering Woodpecker use a five-stage process that involves “key concept extraction, question formulation, visual knowledge validation, visual claim generation, and hallucination correction.”

The researchers claim these techniques provide additional transparency and “a 30.66%/24.33% improvement in accuracy over the baseline MiniGPT-4/mPLUG-Owl.” They evaluated numerous “off the shelf” MLLMs using their method and concluded that Woodpecker could be “easily integrated into other MLLMs.”

from Cointelegraph.com News Tristan Greene

Researchers in China developed a hallucination correction engine for AI models https://ift.tt/KkMYZXE

Post a Comment

0 Comments

Search This Blog

Report Abuse

About Me

Michael Saylor’s Strategy (MSTR) Buys 520 Bitcoin, Raises USD Reserve to $1.4 Billion https://ift.tt/egYZfzc

Facebook

Popular Posts

JPMorgan Chase CEO Jamie Dimon Declares War on Clarity Act, Calls Coinbase’s Armstrong ‘Full of Sh*t’ https://bitcoinmagazine.com/wp-content/uploads/2026/05/JPMorgan-Chase-CEO-Jamie-Dimon-Declares-War-on-Clarity-Act-Calls-Coinbases-Armstrong-‘Full-of-Sht.jpg

Retired Couple Loses $76,000 Life Savings to Bitcoin ATM Scam, Sues Bitcoin Depot in Federal Court https://ift.tt/qy5vjnw

Bukele’s Reform Makes El Salvador a Top Tax Haven: 0% on Foreign Income and Bitcoin Gains with Minimal Presence https://ift.tt/P5Knqep

Categories

Tags

Most Popular

Invite-Only Mita TechTalks 2026 to Unite Bitcoin, AI and Energy Leaders in Punta Mita https://ift.tt/GYbHMUT

Fed Signals Possible Rate Hikes as Kevin Warsh Opens ‘New Chapter’ at Central Bank https://ift.tt/ixLdIln

Cardone Capital’s Bitcoin-REIT Hybrid: Targeting 22-32% Returns by Blending Cash-Flowing Properties and BTC Holdings https://ift.tt/OerSnP4

Labels

Researchers in China developed a hallucination correction engine for AI models https://ift.tt/KkMYZXE

Post a Comment

0 Comments

Search This Blog

About Me

Social Plugin

Facebook

Popular Posts

Categories

Tags

Most Popular

Labels