Weekly | Top 10 GitHub Repos | Week 38 - 2025
Noteworthy data-ops & analytics repos that first shipped less than a year ago.
#10. Z-Gort/Postgres-VectorDB-GUI-Analytics
This repo was first pushed to Github on 2024-12-18. Its primary language is Python.
#9. Zephyruso/zashboard
A dashboard using clash api
This repo was first pushed to Github on 2024-11-13. Its primary language is Vue/TypeScript.
#8. alibaba/Tora
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
This repo was first pushed to Github on 2024-10-14. Its license was listed as: Apache License 2.0. Its primary language is Python.
#7. oop7/YTSage
Modern YouTube downloader with a clean PyQt6 interface. Download videos in any quality, extract audio, fetch subtitles (including auto-generated), and view video metadata. Built with yt-dlp for reliable performance.
This repo was first pushed to Github on 2024-11-29. Its license was listed as: MIT License. Its primary language is Python.
#6. google/langextract
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
["llm", "nlp", "python", "gemini-ai", "information-extration", "large-language-models", "structured-data", "gemini", "gemini-api", "gemini-flash", "gemini-pro"]
This repo was first pushed to Github on 2025-07-09. Its license was listed as: Apache License 2.0. Its primary language is Python.
#5. oxylabs/free-proxy-list
Claim Free proxy list with United States IP addresses and use it for your projects.
["free-proxies", "free-proxies-for-web-scraping", "free-proxy", "free-proxy-ip", "free-proxy-list", "proxies", "proxies-http", "proxies-https", "proxies-list", "proxies-socks5", "proxy", "proxy-list", "proxy-server", "proxypool", "web-scraping"]
This repo was first pushed to Github on 2025-02-06. Its primary language is Mixed/Unspecified.
#4. fast-excel/fesod
Fast. Easy. Done. Processing Excels without worrying about large files causing OOM.
This repo was first pushed to Github on 2024-10-05. Its license was listed as: Apache License 2.0. Its primary language is Java.
#3. autoscrape-labs/pydoll
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions. It supports Python's asynchronous features, enhancing performance and enabling event capturing and simultaneous web scraping.
This repo was first pushed to Github on 2024-10-27. Its license was listed as: MIT License. Its primary language is Python.
#2. TauricResearch/TradingAgents
TradingAgents: Multi-Agents LLM Financial Trading Framework
This repo was first pushed to Github on 2024-12-28. Its primary language is JavaScript/HTML.
#1. documentdb/documentdb
MongoDB-compatible database engine for cloud-native and open-source workloads. Built for scalability, performance, and developer productivity.
This repo was first pushed to Github on 2025-01-23. Its license was listed as: MIT License. Its primary language is C.



