Weekly | Top 8 GitHub Repos | Week 32 - 2025
Noteworthy data-ops & analytics repos that first shipped less than a year ago.
#8. rohan-chandrashekar/5G-Network-Slicing
5G Network Slicing Simulation project designed to explore dynamic resource allocation and performance optimization across network slices, including eMBB, mMTC, and URLLC. Developed with modular Python architecture and detailed performance metrics.
Repo topic tags: 5g, 5g-simulation, network-analysis, network-slicing, python
Its up 1 new star on 2025-08-09 from 25 total stars the day before. It ranked at #3409 by new stars relative to its star count one day prior to 2025-08-09.
This repo was first pushed to Github on 2024-08-26. Its primary language is Python.
#7. thuhcsi/SpeechCraft
The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.
Its up 1 new star on 2025-08-09 and ranked at #3879 out of all github repos that first shipped less than a year ago.
This repo was first pushed to Github on 2024-08-21. Its primary language is Mixed/Unspecified.
#6. utkarsh9795/tomato_food_delivery_app
Tomato is a full-stack food delivery web application developed using the MERN (MongoDB, Express, React, Node.js) stack. It combines frontend, backend, and an admin panel, creating a streamlined experience for users, delivery personnel, and administrators alike.
Repo topic tags: full-stack-project, mern-stack-project, food-delivery-web-app, expressjs, jwt-authentication, mongodb, nodejs, reactjs
Its up 1 new star on 2025-08-09 and ranked at #2035 out of all github repos that first shipped less than a year ago.
This repo was first pushed to Github on 2024-11-01. Its primary language is JavaScript.
#5. stockeh/mlx-optimizers
A collection of optimizers for MLX
Repo topic tags: mlx, optimization
Its up 2 new stars on 2025-08-09 and ranked at #1157 out of all github repos that first shipped less than a year ago.
This repo was first pushed to Github on 2024-11-07. Its license was listed as: Apache License 2.0. Its primary language is Python.
#4. mhmdkardosha/CAT-Reloaded-2025-Data-Science-Roadmap
Roadmap for Data Science circle associated with CAT Reloaded.
Repo topic tags: data-cleaning, data-science, data-visualization, deep-learning, machine-learning, neural-networks, python, web-scraping-python
Its up 1 new star on 2025-08-09 from 37 total stars the day before. It ranked at #3858 by new stars relative to its star count one day prior to 2025-08-09.
This repo was first pushed to Github on 2024-11-24. Its primary language is Mixed/Unspecified.
#3. GeorgeHanyMilad/Data-Analysis-and-BI-Resources
Data Analysis and BI Resources 📊
Repo topic tags: business-intelligence, data-visualization, dataanalysis, database, excel, powerbi, python, sql, tableau
Its up 1 new star on 2025-08-09 from 41 total stars the day before. It ranked at #3993 by new stars relative to its star count one day prior to 2025-08-09.
This repo was first pushed to Github on 2024-08-27. Its primary language is Mixed/Unspecified.
#2. steineggerlab/folddisco
Fast indexing and search of discontinuous motifs in protein structures
Repo topic tags: bioinformatics, foldseek, motif, protein-structure
Its up 13 new stars this month from 33 total stars last month. It ranked at #1959 by new stars relative to its star count one month prior to 2025-08-09.
This repo was first pushed to Github on 2024-10-23. Its license was listed as: GNU General Public License v3.0. Its primary language is Rust/C++.
#1. tobilg/duckerd
CLI to create an ER Diagram from DuckDB database files
Repo topic tags: cli, database-diagram, duckdb, erd, erdiagram
Its up 1 new star on 2025-08-09 and ranked at #1805 out of all github repos that first shipped less than a year ago.
This repo was first pushed to Github on 2024-09-13. Its primary language is TypeScript.
Join the conversation: https://discord.gg/jCeSn3M7
Thank you for reading Decoded Data and being part of this journey of discovering great open-source projects.
In the coming weeks, I'll be upgrading the classification system from keyword matching to semantic labeling using text embeddings. This should significantly improve how repositories are categorized and ensure you see the most relevant projects for your interests.
Beyond the technical improvements, I want to invite current readers to join me in taking this platform to the next level. With this goal in mind, I've created a Discord server where we can come together to discuss the future of this project.
I've got a lot of different ideas for potential directions to take this project, and your input would be invaluable for prioritizing what's on my mind already. Better yet, if you have ideas to share then I would love to hear them out and put them on the roadmap.
If you just want to come along for the ride, that's OK, too! There will certainly be many interesting insights shared exclusively in the server as things progress. I'm still trying to determine what to make of this project, and your very presence helps me decide, be it vocal or voiceless!
Join the conversation: https://discord.gg/jCeSn3M7
Your presence makes DevInsight better - come be part of what's next!


