Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Analyzing-Wikipedia-Clickstreams-with-PySpark-Project

Public

This project focuses on analyzing Wikipedia's clickstream data to uncover patterns in how users navigate from one article to another. Utilizing Apache Spark and PySpark for data manipulation and analysis, the project aims to provide insights into user behavior on Wikipedia, including the most popular pathways to specific articles.

Creat2024-02-16T00:00:56
Update2025-06-24T15:03:23
2
Stars
0
Stars Increase