Detect
Develop
Design
Detect Develop Design
The Problem
Have you ever searched for information online and found very little, or what you did find seemed questionable?
These gaps are called DATA VOIDS. Data voids occur when there is a lack of reliable information on certain topics online. These gaps often appear around new, niche, or rapidly changing subjects where trustworthy content hasn't yet been established. In these voids, search engines struggle to provide accurate results because there simply isn't enough reliable information available. This creates a vulnerable spot that can be easily exploited by conspiracy theorists and propagandists.
These actors exploit data voids by generating misleading or false content to fill these gaps. They manipulate search engine algorithms to make their content appear more prominently, leading people to encounter falsehoods instead of facts. This content is often sensational or emotionally charged, making it more likely to be shared and believed.
The effects of this manipulation can be profound. When misinformation fills these data voids, it shapes public opinion and influences decision-making in harmful ways. This is particularly concerning in areas like public health, financial markets, and democratic processes. False information can lead to dangerous myths about critical issues, resulting in harmful behaviors. It can cause unnecessary panic or false optimism, leading to poor decisions and harmful outcomes for the public. Misinformation undermines the public’s trust in important institutions and processes, eroding confidence in systems and institutions, causing public confusion.
The lack of timely and reliable information in these data voids allows harmful content to gain traction and influence people before effective countermeasures can be implemented. This makes it harder to correct misconceptions and restore accurate information, highlighting the importance of addressing data voids to ensure access to trustworthy information on important topics.
Our Approach
At the forefront of our efforts to combat misinformation are the innovative ways we are leveraging Wikipedia.
At the heart of our mission to combat misinformation lies our innovative use of Wikipedia—one of the world’s most visited information sources and a frequent first stop in search results. Given its prominence, Wikipedia plays a crucial role in the battle for accurate information.
Orphan Articles: These are lesser-known articles on Wikipedia that receive fewer edits and links from other pages. By analyzing these articles, we aim to detect early signs of misinformation and prevent it from spreading.
Phrase Frequencies: We track how often specific terms and phrases appear in Wikipedia articles. Unusual spikes or patterns can indicate emerging data voids, allowing us to identify and address misinformation quickly.
References Cited: Reliable Wikipedia articles are supported by credible sources. By evaluating the quality and frequency of citations, we pinpoint areas that may lack trustworthy references and take action to prevent misinformation from taking root.
We call this the "dark matter of Wikipedia," referring to the less visible but critical parts of the site. By understanding the relationship between orphan articles, phrase frequencies, and references, we aim to reveal how data voids form and evolve. Our approach harnesses Wikipedia's vast resources and advanced data science to detect and fill data voids with reliable information. By staying ahead of misinformation, we protect public knowledge and ensure access to trustworthy information on critical topics.