Reconstructing Detailed Browsing Activities from Browser History

02/07/2021
by   Geza Kovacs, et al.
0

Users' detailed browsing activity - such as what sites they are spending time on and for how long, and what tabs they have open and which one is focused at any given time - is useful for a number of research and practical applications. Gathering such data, however, requires that users install and use a monitoring tool over long periods of time. In contrast, browser extensions can gain instantaneous access months of browser history data. However, the browser history is incomplete: it records only navigation events, missing important information such as time spent or tab focused. In this work, we aim to reconstruct time spent on sites with only users' browsing histories. We gathered three months of browsing history and two weeks of ground-truth detailed browsing activity from 185 participants. We developed a machine learning algorithm that predicts whether the browser window is focused and active at one second-level granularity with an F1-score of 0.84. During periods when the browser is active, the algorithm can predict which the domain the user was looking at with 76.2 total time spent online for each user with an R^2 value of 0.96, and the total time each user spent on each domain with an R^2 value of 0.92.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2017

A History of Metaheuristics

This chapter describes the history of metaheuristics in five distinct pe...
research
02/18/2022

Simulating User-Level Twitter Activity with XGBoost and Probabilistic Hybrid Models

The Volume-Audience-Match simulator, or VAM was applied to predict futur...
research
11/29/2017

Real-Time System for Human Activity Analysis

We propose a real-time human activity analysis system, where a user's ac...
research
08/01/2022

Revisiting Information Cascades in Online Social Networks

It's by now folklore that to understand the activity pattern of a user i...
research
12/14/2018

Using Detailed Access Trajectories for Learning Behavior Analysis

Student learning activity in MOOCs can be viewed from multiple perspecti...
research
09/11/2019

Tracking the untracked

The issue of seamless identification of users previously tracked using e...
research
07/11/2020

MFED: A System for Monitoring Family Eating Dynamics

Obesity is a risk factor for many health issues, including heart disease...

Please sign up or login with your details

Forgot password? Click here to reset