TopicTrekker: GitHub Repository Scraper

Successfully scraped data from GitHub, a renowned platform for developers, and extracted information about the top repositories within various GitHub topics.

About Project

Our goal is to scrap data about the top repositories within various GitHub Topics section and create a structured dataset containing repository details, such as repository name, owner username, star count, and repository URL etc.

This project serves as an exploration of web scraping techniques and data extraction from dynamic web pages.

Project Stack

  • Programming Language: Python

  • Libraries: requests, Beautiful Soup, Pandas

  • Development Environment: Jupyter Notebook

This project serves as an exploration of web scraping techniques and data extraction from dynamic web pages.

Project Stack

  • Programming Language: Python

  • Libraries: requests, Beautiful Soup, Pandas

  • Development Environment: Jupyter Notebook

This project serves as an exploration of web scraping techniques and data extraction from dynamic web pages.

Project Stack

  • Programming Language: Python

  • Libraries: requests, Beautiful Soup, Pandas

  • Development Environment: Jupyter Notebook

This project serves as an exploration of web scraping techniques and data extraction from dynamic web pages.

Project Stack

  • Programming Language: Python

  • Libraries: requests, Beautiful Soup, Pandas

  • Development Environment: Jupyter Notebook

"Extracted valuable data from dynamic websites. We organized this data into structured datasets and gained insights into user engagement on GitHub. Detailed analysis and code can be accessed in a Jupyter Notebook"

Waqas Ahmad – Data Analyst

Let's collaborate and bring your vision to life!

©2023 Designed by Waqas Ahmad

Going up?

Let's collaborate and bring your vision to life!

©2023 Designed by Waqas Ahmad

Going up?