The Times Australia
Google AI
The Times World News

.

Matching tweets to ZIP codes can spotlight hot spots of COVID-19 vaccine hesitancy

  • Written by Mayank Kejriwal, Research Assistant Professor of Industrial & Systems Engineering, University of Southern California
Matching tweets to ZIP codes can spotlight hot spots of COVID-19 vaccine hesitancy

Public health officials are focusing on the 30% of the eligible population that remains unvaccinated[1] against COVID-19 as of the end of October 2021, and that requires figuring out where those people are and why they are unvaccinated.

People remain unvaccinated for many reasons[2], including belief in unfounded conspiracy theories about the disease, the vaccines or both; distrust of the medical establishment; concerns about risks and side effects; fear of needles; and difficulty accessing vaccines. To target their messaging and outreach geographically and according to the type of hesitancy, public health officials need good data to guide their efforts. Traditional survey methods[3] are helpful but tend to be expensive[4].

Another approach is to assess vaccine hesitancy through the lens of social media. As an artificial intelligence researcher[5], I analyze social media data using machine learning. My latest research[6], conducted with graduate student Sara Melotte and accepted for publication in the journal PLOS Digital Health, predicts the degree of vaccine hesitancy at the ZIP code level in U.S. metropolitan areas by analyzing geo-located tweets.

We found that by processing geo-located Twitter data using readily available machine learning techniques, we could more accurately predict vaccine hesitancy by ZIP code than by using attributes of ZIP codes like average home price and number of health care and social services facilities.

The limits of surveys

Surveys, such as a Gallup COVID-19 survey[7] launched in 2020, estimate vaccine hesitancy levels in the general population by polling a representative sample with a Yes/No vaccine hesitancy question: If a Food and Drug Administration-approved vaccine to prevent coronavirus/COVID-19 was available right now at no cost, would you agree to be vaccinated? The estimated vaccine hesitancy is the percentage of individuals who respond “No.” As demonstrated both in our research[8] and work by others[9], factors such as location, income and education levels all correlate with vaccine hesitancy.

A general disadvantage of such surveys is that detailed questions are expensive to administer[10]. Sample sizes tend to be small due to cost constraints and non-response rates. The latter has been exacerbated recently[11] by political polarization. Computational social science[12] methods, which use computer algorithms to analyze large amounts of data, are another option, but they can have trouble interpreting noisy social media text to glean insights.

Mining Twitter

Our work takes on the challenge of using publicly available Twitter data to accurately predict vaccine hesitancy in a given ZIP code. We focused on ZIP codes in major metropolitan areas, which are known for high tweeting activity[13]. Users also enable GPS more often in these areas.

Screenshot of a settings page within the Twitter smart phone app
The Twitter smartphone app includes an option for users to tag their tweets with their precise GPS location. Screenshot by The Conversation U.S., CC BY-ND[14]

As a first step, we downloaded all the tweets from a publicly available dataset called GeoCoV19[15], which filters tweets to be as relevant to COVID-19 as possible. Next, using peer-reviewed methodology[16], we filtered the tweets down to GPS-enabled tweets from the top metropolitan areas. We then randomly split the tweets into a training set and a test set. The former was used to develop the model, while the latter was used to evaluate the model.

Training a model to predict the vaccine hesitancy of a ZIP code is like drawing a straight line through a set of points so that the line comes as close as possible to the center of the points, known as a line of best fit[17]. The line indicates the trend in the data. The first step is converting the raw text of tweets into data points.

[The Conversation’s science, health and technology editors pick their favorite stories. Weekly on Wednesdays[18].]

Recently developed deep neural networks[19] are able to automatically convert the text into data points so that tweets with similar meanings are closer together. We essentially used such a network to convert our tweets to data points and then trained our machine learning model on those data points. We validated our model using the Gallup COVID-19 survey results.

Our method performed better at predicting high levels of vaccine hesitancy than methods that only use generic features, like average home prices within the ZIP code[20], rather than social media data. We also showed our model to be effective in the presence of tweets that aren’t related to vaccines or COVID-19. The GeoCov19 dataset is good but includes many tweets that are not relevant specifically to vaccines and a small – but non-trivial – fraction that are not relevant to COVID-19 at all.

Early detection and prevention

In research currently undergoing peer review, we developed algorithms that automatically mine potential causes of vaccine hesitancy, and their extent, from social media. Our preliminary analysis confirms that while some causes are the result of conspiracy theories[21] and misinformation, others are informed by legitimate concerns such as potential vaccine side effects.

We expect that people with these concerns may be much more amenable to getting vaccinated if they are presented with reliable sources of information[22] that assuage their fears. In the future, public health officials could use machine learning for early detection of vaccine hesitancy on social media. Then they could use algorithms to automatically distribute targeted information and go on the offense against the spread of health-related misinformation[23].

Such future digital public health systems could lead to healthier outcomes, both in the physical and digital realms.

References

  1. ^ 30% of the eligible population that remains unvaccinated (covid.cdc.gov)
  2. ^ remain unvaccinated for many reasons (www.bbc.com)
  3. ^ Traditional survey methods (dx.doi.org)
  4. ^ tend to be expensive (dx.doi.org)
  5. ^ artificial intelligence researcher (scholar.google.com)
  6. ^ My latest research (arxiv.org)
  7. ^ Gallup COVID-19 survey (news.gallup.com)
  8. ^ our research (doi.org)
  9. ^ work by others (doi.org)
  10. ^ expensive to administer (dx.doi.org)
  11. ^ exacerbated recently (doi.org)
  12. ^ Computational social science (doi.org)
  13. ^ high tweeting activity (ojs.aaai.org)
  14. ^ CC BY-ND (creativecommons.org)
  15. ^ GeoCoV19 (arxiv.org)
  16. ^ peer-reviewed methodology (doi.org)
  17. ^ line of best fit (www.statisticshowto.com)
  18. ^ Weekly on Wednesdays (theconversation.com)
  19. ^ Recently developed deep neural networks (www.aclweb.org)
  20. ^ average home prices within the ZIP code (www.zillow.com)
  21. ^ result of conspiracy theories (doi.org)
  22. ^ reliable sources of information (www.mayoclinic.org)
  23. ^ health-related misinformation (doi.org)

Read more https://theconversation.com/matching-tweets-to-zip-codes-can-spotlight-hot-spots-of-covid-19-vaccine-hesitancy-169596

Times Magazine

Governance Models for Headless CMS in Large Organizations

Where headless CMS is adopted by large enterprises, governance is the single most crucial factor d...

Narwal Freo Z Ultra Robotic Vacuum and Mop Cleaner

Rating: ★★★★☆ (4.4/5)Category: Premium Robot Vacuum & Mop ComboBest for: Busy households, ha...

Shark launches SteamSpot - the shortcut for everyday floor mess

Shark introduces the Shark SteamSpot Steam Mop, a lightweight steam mop designed to make everyda...

Game Together, Stay Together: Logitech G Reveals Gaming Couples Enjoy Higher Relationship Satisfaction

With Valentine’s Day right around the corner, many lovebirds across Australia are planning for the m...

AI threatens to eat business software – and it could change the way we work

In recent weeks, a range of large “software-as-a-service” companies, including Salesforce[1], Se...

Worried AI means you won’t get a job when you graduate? Here’s what the research says

The head of the International Monetary Fund, Kristalina Georgieva, has warned[1] young people ...

The Times Features

Taste Port Douglas celebrates 10 years of world-class flavour in the tropics

30+ events, new sunrise and wellness experiences, 20+ chefs and a headline Michelin-star line-up...

Oztent RV tent range. Buy with caution

A review of the Oztent RV "30 second tent" range. Three years ago we bought an RV-4 from BCF Mack...

Essential Upgrades for a Smarter, Safer Australian Home

As we settle into 2026, the concept of the "dream home" has fundamentally shifted. The focus has m...

How To Modernise Your Home Without Overcapitalising

For many Australian homeowners, the dream of a "Grand Designs" transformation is often checked by ...

The Art of the Big Trip: Planning a Seamless Multi-Generational Getaway in Tropical North Queensland

There is a unique magic to the multi-generational holiday. It is a rare opportunity where gr...

Love Without Borders: ‘Second Marriage At First Sight’ Opens Casting Call for Melbourne Singles Willing to Relocate for Romance

Fans of Married At First Sight UK and Married At First Sight Australia are about to see the expe...

Macca’s is bringing pub-style vibes to the menu with the new Bistro Béarnaise Angus range

Two indulgent Aussie Angus burgers – plus the arrival of Kirks Lemon, Lime & Bitters – the  ...

What are your options if you can’t afford to repay your mortgage?

After just three rate cuts in 2025, interest rates have risen again[1] in Australia this year. I...

Small, realistic increases in physical activity shown to significantly reduce risk of early death

Just Five Minutes More a Day Could Prevent Thousands of Deaths, Landmark Study Finds Small, rea...