Web Scraping and Its Potential for Government and Beyond
October 2022

A group of people working on laptops

Description automatically generated with low confidence

Photo Credit: Ryan Hatano, USAID LO-MSTR Project Photo

With increasing focus on how local organizations contribute to USAID-funded activities, it’s important to look at new ways of applying known technologies. Web scraping — which refers simply to extracting data from a website, collecting it, and then exporting it into a spreadsheet, API, or other format useful for a particular purpose or application — is one such technology. On a project in Cambodia, The Cloudburst Group demonstrated the usefulness of web scraping to gather evidence on the effectiveness of USAID capacity building to strengthen local organizations.

USAID’s Cambodia Small Business Applied Research (SBAR) Award: Local Organizations – Movement Towards Self-Reliance (LO-MTSR) Activity, implemented by The Cloudburst Group, PartnersGlobal, and Devlab@Penn (formerly Duke), designed a first-of-its-kind experimental evaluation to measure the scalability of an existing intervention on increasing organizational resiliency to closing civic spaces. The team did this by deepening the connectivity of civil society organizations (CSOs) to their communities, other CSOs, and potential funders. The program involved the evaluation of a series of capacity-building activities and trainings on topics including financial diversification, social media and marketing, well-being in the workplace, and contingency planning.

As one component of the program, CSOs throughout Cambodia working across health, education, food security, environment, and democracy, rights, and governance were provided coaching on how increasing social network connections, including with global and local donors, could diversify revenue and influence sustainability. Organizational-level coaching was provided on entrepreneurial strategies for increasing network connections via social media for promotion, marketing, and networking. To measure the impact of enhanced network connections within a social network analysis framework, the evaluation team utilized innovative social media data collection methodologies to gather and analyze information on CSOs’ web presence. The evaluation team developed sophisticated Python scripts that mined a huge volume of social media data, mostly from Facebook, to gauge changes in public engagement via social media (measured by the number of likes, comments, and shares).

On the more than 20,000 social media records mined, the team cleaned and analyzed the data and measured indicators of an organization’s connectedness within a social network analysis framework including with global and local donors. This approach allowed the team to compile a detailed account of each organization’s social media presence throughout the three-year lifespan of the project and gauge how organizations’ social media presence responded to external events (COVID-19 lockdowns, elections, major public holidays, etc.) and the project’s social media trainings.

Though organizations learned new skills that may ultimately improve their resiliency, findings indicate that the training in social media use did not positively or negatively impact network connectedness. These findings gave novel insights into the challenges of designing and implementing effective social media capacity-building activities. Results provided to USAID highlighted opportunities and challenges in supporting a vibrant and resilient civil society able to participate in and reap the benefits of future USAID funding.

Web Scraping as a Tool for the Future…

There is great potential for applying this type of innovative data collection in other ways. For example, social media scraping could be used to identify patterns in the spread of dis- and misinformation. Understanding certain patterns and the context around the spread of such information may help identify opportunities for programming by USAID and others to combat it. It has also been used in the past to provide real-time information for monitoring a variety of programs, allowing those programs to adapt and respond to changing conditions. Other opportunities include identifying mechanisms to support networks of activists and human rights defenders. While social media is often used as a means by which to identify and prosecute these vulnerable groups, strong social networks also contribute to knowledge sharing and improved security for them.

A group of people standing in a room

Description automatically generated with medium confidence

Photo Credit: Ryan Hatano, USAID LO-MSTR Project Photo

OTHER STORIES

Excited and Energized for 2024
Promoting Positive Workplace Development & DEI-A...
Trigon Associates: Advancing Environmental Remediation and...
Equipping Tourism Stakeholders in the Maldives with Climate...
C. Kreuz Consulting Prepares Guyanese Agro-preneurs for...
Reflections on IBI’s 2023 Achievements: Strategies...
Evidence to Action: Closing the Gap between Evidence...
Reflecting on 2023: Achievements and Milestones with...
Using Geographic Information Systems to Enhance...
Harnessing the Power of Partnership: CULTIVA and CCI...
A Year in Review: Small Business as Partners in...
ECODIT Promotes Sustainable Development Through Gender...
A Year in Review: Small Business Impact and Lessons Learned...
Local Strength, Connexus Touch: Transforming Colombian...
Banyan Global Leverages the Power of Health Data...
JMO Communications: Championing Global Progress through...
Child Protection and Human Capital: New Directions for...
SBAIC MEMBER SEARCH
Narrow down the search by choosing items from drop-down lists.
Member Search
Small Business Association
for International Companies
Small Business Association for International Companies is a membership organization established to promote the meaningful utilization of U.S. small businesses at U.S. government agencies providing foreign assistance.
2001 L Street, NW Suite 500          Washington, D.C. 20036           Phone (310) 242-3030
© Copyright 2023 Small Business Association for International Companies