This repository contains web crawling and analysis code for the IMC 2024 paper "Browsing without Third-Party Cookies: What Do You See?".
Only Ubuntu 20.04 LTS is officially supported.
To create the cookie conda environment, execute:
conda env create -f environment.ymlActivate the environment with:
conda activate cookieTo start a crawl, execute:
python3 sbatch_main.py --jobs <number of slurm jobs>If you do not have Slurm, you can start a single job using main.py.
After crawling, use extract_differences.py to compute differences in extracted features.
To analyze the differences, use classification_algo.ipynb.