Weekly | Top 10 GitHub Repos | Week 46 - 2024
Noteworthy data-ops & analytics repos that first shipped in the past year.
#10. CosmosShadow/gptpdf
Using GPT to parse PDF
Its up 155 new stars this month and was created 143 days ago. It ranked at #512 by new stars relative to its age in days.
This repo was first pushed to Github on 2024-06-28. Its primary language is Python.
#9. observablehq/framework
A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data analysis.
Repo topic tags: d3, dashboard, static-site-generator, visualization, framework, markdown
Its up 20 new stars this week and ranked at #1147 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2024-02-15. Its license was listed as: ISC License. Its primary language is TypeScript.
#8. SqueezeAILab/LLMCompiler
LLMCompiler: An LLM Compiler for Parallel Function Calling
Repo topic tags: function-calling, llm, llm-agent, llm-agents, llms, parallel-function-call, efficient-inference, large-language-models, llama, llama2, llm-framework, natural-language-processing, nlp, transformer
Its up 15 new stars this week and ranked at #1779 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2023-12-08. Its license was listed as: MIT License. Its primary language is Python.
#7. data-infra/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Its up 513 new stars this month and was created 139 days ago. It ranked at #112 by new stars relative to its age in days.
This repo was first pushed to Github on 2024-07-02. Its license was listed as: Other. Its primary language is Jupyter Notebook.
#6. Canner/WrenAI
WrenAI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely.
Repo topic tags: bigquery, duckdb, llm, openai, postgresql, rag, text-to-sql, ai, sql, python, typescript, agent, fastapi, nextjs, gpt, nlp
Its up 23 new stars this week and ranked at #957 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2024-04-09. Its license was listed as: GNU Affero General Public License v3.0. Its primary language is TypeScript.
#5. DeepInsight-AI/DeepBI
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
Repo topic tags: csv, docker, gpt, gpt-4, mysql, postgresql, python38, redis, vue
Its up 197 new stars this month and ranked at #455 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2023-11-20. Its primary language is Mixed/Unspecified.
#4. frectonz/sql-studio
SQL Database Explorer [SQLite, libSQL, PostgreSQL, MySQL/MariaDB, DuckDB]
Repo topic tags: rust, sqlite, sqlite-browser, libsql, postgresql, mariadb, mysql
Its up 4 new stars on 2024-09-22 and was created 160 days ago. It ranked at #667 by new stars relative to its age in days.
This repo was first pushed to Github on 2024-06-11. Its license was listed as: MIT License. Its primary language is Rust,TypeScript.
#3. amazon-science/chronos-forecasting
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
Repo topic tags: forecasting, large-language-models, llm, machine-learning, time-series, foundation-models, pretrained-models, time-series-forecasting, timeseries, artificial-intelligence, huggingface, huggingface-transformers
Its up 105 new stars this month and ranked at #1042 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2024-03-13. Its license was listed as: Apache License 2.0. Its primary language is Python.
#2. decodingml/llm-twin-course
🤖 LLM Twin FREE Course: Building Your Production-Ready AI Replica | An End-to-End Framework for Production-Ready LLM Systems by Building Your LLM Twin | WIP...
Repo topic tags: aws, bytewax, comet-ml, generative-ai, large-language-models, machine-learning-engineering, ml-system-design, mlops, qdrant, qwak, superlinked
Its up 185 new stars this month and ranked at #492 out of all github repos that first shipped more than a month ago but less than a year ago.
This repo was first pushed to Github on 2024-03-08. Its license was listed as: MIT License. Its primary language is Mixed/Unspecified.
#1. briefercloud/briefer
Dashboards and notebooks in a single place. Create powerful and flexible dashboards using code, or build beautiful Notion-like notebooks and share them with your team.
Repo topic tags: analytics, bi, bigquery, briefer, business-intelligence, businessintelligence, dashboard, data-analysis, data-visualization, jupyter, notebook, postgres, postgresql, reporting, visualization
Its up 2820 new stars this month and was created 70 days ago. It ranked at #10 by new stars relative to its age in days.
This repo was first pushed to Github on 2024-09-09. Its license was listed as: GNU Affero General Public License v3.0. Its primary language is TypeScript.