Happy New Year 2024! As the first post in the new year, just like what I did before, I’m very curious about what were the most popular Python projects so far. GitHub is definitely the most suitable place to have these statistics. Although not all the open-sourced projects will be maintained here, there won’t be any other single places that are better than here.
This rank is meant to be easy because I’ll share my code. Now, let’s have a look at how we can get the ranked list from GitHub API with a few lines of code. After that, I’ll categorize these projects using my terminology and then add some short introductions to them.
The Top 30 GitHub projects are categorized as follows:
- 2 repositories: Machine Learning Frameworks
- 3 repositories: AI-driven Applications
- 8 repositories: Software Development Frameworks
- 2 repositories: Development Productivity Tools
- 3 repositories: Useful Information Catalog
- 8 repositories: Educative Content
- 4 repositories: Real-World Applications
The official API documentation can be found on this page:
https://docs.github.com/en/rest/reference/search#search-repositories
So, I won’t repeat the details of the API Endpoint such as the parameters in this article. If you are interested in what else we can do with the GitHub API, please refer to that page.
The most beautiful thing is that we don’t need to register or apply for an API key to use this endpoint. Of course, it has a rate limit which is up to 10 requests per minute, but it is not a problem for us as we just want to get several top-ranked repos. A few times of API calls will be enough for us to debug.
First of all, we need to use the requests
module of Python. It is built-in and I believe most of you should be familiar with…