Webmunk

Webmunk is a web browser extension and data collection platform to conduct observational and experimental research studies on the Internet.

Webmunk is open-source and available to all interested researchers. You can access the code and documentation here. A researcher-oriented description of the Webmunk project and how to modify it for custom research studies is available in the white paper.

The primary creators of Webmunk are Chiara Farronato (Harvard Business School), Andrey Fradkin (Boston University), and Chris Karr (Audacious Software). Please contact Professors Farronato or Fradkin to discuss any research related aspects of Webmunk. Please contact Chris Karr at Audacious Software if you're in need of development assistance or consulting.



Functionalities

Tracking a user’s experience:

  • Web browsing history.
  • Website content.
  • Browser settings.
  • User interactions with websites (e.g. scrolling).
  • Cookies.

Modification of user experience:

  • Changing the look and feel of a website.
  • Inserting new content and new interactions into the website.
  • Removing and setting cookies.
  • Acting as a user proxy, by for example filling in forms and selecting options.
  • Reminding users to fill out surveys and conduct certain actions.

Screenshots

Flexible enrollments
Timed study tasks
Add and manipulate page elements

Details

The browser extension component of Webmunk is designed for the Chrome and Edge browsers. The extension can modify the content of web pages in real time and collect data as users browse the Internet. By keeping track of (anonymized) user identities, the extension can implement different web page manipulations across users or over time, which enables researchers to assign different experimental interventions that they may want to compare. The extension uses various techniques to engage study participants in specific tasks beyond their regular browsing activity, such as completing surveys or visiting designated websites.

The cloud server component of Webmunk encompasses two main elements. First, an enrollment server handles the generation of random user IDs and the allocation of experimental treatments and configurations across users and over time. Second, a data transmission server serves as the central hub for collecting both passive and experimental data, which can be stored and analyzed at a later stage. Both servers use the Passive Data Kit (PDK), a comprehensive data collection framework.

All components of Webmunk will be open-source and are designed to be easily extended. Additional functionality can be added to Webmunk through the use of modules. Modules are self-contained and re-usable components that work with Webmunk. In our projects, we've designed modules for identifying products on Amazon and for retrieving users' Amazon order histories. Modules are particularly useful for implementing manipulation and tracking that is particular to a web domain.

More information is available in the white paper.


© 2022 - 2024, The Fradkin Foundation and the President & Fellows of Harvard College