Top 10 Trending GitHub Repositories, January 2025
Welcome to our weekly roundup of the Top 10 Trending GitHub Repositories for the week of January 20, 2025. These projects have gained significant attention and are worth exploring as we kick off the new year. Let’s dive in!
1. OpenBMB / MiniCPM-o
Description: MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech, and Multimodal Live Streaming on Your Phone.
Link to Repository: Visit Repository
OpenBMB / MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
中文 | English
WeChat | DiscordMiniCPM-o 2.6 🤗 🤖 | MiniCPM-V 2.6 🤗 🤖 | Technical Blog Coming Soon
MiniCPM-o is the latest series of end-side multimodal LLMs (MLLMs) ungraded from MiniCPM-V. The models can now take image, video, text, and audio as inputs and provide high-quality text and speech outputs in an end-to-end fashion. Since February 2024, we have released 6 versions of the model, aiming to achieve strong performance and efficient deployment. The most notable models in the series currently include:
-
MiniCPM-o 2.6: 🔥🔥🔥 The latest and most capable model in the MiniCPM-o series. With a total of 8B parameters, this end-to-end model achieves comparable performance to GPT-4o-202405 in vision, speech, and multimodal live streaming, making it one of the most versatile and performant models in the open-source community. For…
2. TabbyML / Tabby
Description: Self-hosted AI coding assistant.
Link to Repository: Visit Repository
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features:
- Self-contained, with no need for a DBMS or cloud service.
- OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE).
- Supports consumer-grade GPUs.
🔥 What's New
- 12/06/2024 Llamafile deployment integration and enhanced Answer Engine user experience are coming in Tabby v0.21.0!🚀
- 11/10/2024 Switching between different backend chat models is supported in Answer Engine with Tabby v0.20.0!
- 10/30/2024 Tabby v0.19.0 featuring recent shared threads on the main page to improve their discoverability.
Archived
- 07/09/2024 🎉Announce Codestral integration in Tabby!
- 07/05/2024 Tabby v0.13.0 introduces Answer Engine, a central knowledge engine for internal engineering teams. It seamlessly integrates with dev team's internal data, delivering reliable and precise answers to empower developers.
- 06/13/2024 VSCode 1.7 marks a significant…
3. FujiwaraChoki / MoneyPrinterV2
Description: Automate the process of making money online.
Link to Repository: Visit Repository
FujiwaraChoki / MoneyPrinterV2
Automate the process of making money online.
MoneyPrinter V2
An Application that automates the process of making money online MPV2 (MoneyPrinter Version 2) is, as the name suggests, the second version of the MoneyPrinter project. It is a complete rewrite of the original project, with a focus on a wider range of features and a more modular architecture.
Note: MPV2 needs Python 3.9 to function effectively Watch the YouTube video here
Features
-
Twitter Bot (with CRON Jobs =>
scheduler
) -
YouTube Shorts Automater (with CRON Jobs =>
scheduler
) - Affiliate Marketing (Amazon + Twitter)
- Find local businesses & cold outreach
Versions
MoneyPrinter has different versions for multiple languages developed by the community for the community. Here are some known versions:
- Chinese: MoneyPrinterTurbo
If you would like to submit your own version/fork of MoneyPrinter, please open an issue describing the changes you made to the fork.
Installation
Please install Microsoft Visual C++ build tools first, so that CoquiTTS…
4. KoljaB / RealtimeSTT
Description: A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation, and instant transcription.
Link to Repository: Visit Repository
KoljaB / RealtimeSTT
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
RealtimeSTT
Easy-to-use, low-latency speech-to-text library for realtime applications
New
- AudioToTextRecorderClient class, which automatically starts a server if none is running and connects to it. The class shares the same interface as AudioToTextRecorder, making it easy to upgrade or switch between the two. (Work in progress, most parameters and callbacks of AudioToTextRecorder are already implemented into AudioToTextRecorderClient, but not all. Also the server can not handle concurrent (parallel) requests yet.)
- reworked CLI interface ("stt-server" to start the server, "stt" to start the client, look at "server" folder for more info)
About the Project
RealtimeSTT listens to the microphone and transcribes voice into text.
Hint: Check out Linguflex, the original project from which RealtimeSTT is spun off. It lets you control your environment by speaking and is one of the most capable and sophisticated open-source assistants currently available.
It's ideal for:
- Voice Assistants
- Applications requiring fast and precise speech-to-text conversion
RealtimeSTT.Demo.video.mp4
5. Harry0703 / MoneyPrinterTurbo
Description: Generate high-quality short videos with one click using AI LLMs.
Link to Repository: Visit Repository
harry0703 / MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
MoneyPrinterTurbo 💸
简体中文 | English
只需提供一个视频 主题 或 关键词 ,就可以全自动生成视频文案、视频素材、视频字幕、视频背景音乐,然后合成一个高清的短视频。
Web界面
API界面
特别感谢 🙏
由于该项目的 部署 和 使用,对于一些小白用户来说,还是 有一定的门槛,在此特别感谢
录咖(AI智能 多媒体服务平台) 网站基于该项目,提供的免费AI视频生成器
服务,可以不用部署,直接在线使用,非常方便。
感谢赞助 🙏
感谢佐糖 https://picwish.cn 对该项目的支持和赞助,使得该项目能够持续的更新和维护。
佐糖专注于图像处理领域,提供丰富的图像处理工具,将复杂操作极致简化,真正实现让图像处理更简单。
功能特性 🎯
-
完整的 MVC架构,代码 结构清晰,易于维护,支持
API
和Web界面
- 支持视频文案 AI自动生成,也可以自定义文案
-
支持多种 高清视频 尺寸
-
竖屏 9:16,
1080x1920
-
横屏 16:9,
1920x1080
-
竖屏 9:16,
- 支持 批量视频生成,可以一次生成多个视频,然后选择一个最满意的
- 支持 视频片段时长 设置,方便调节素材切换频率
- 支持 中文 和 英文 视频文案
- 支持 多种语音 合成,可 实时试听 效果
-
支持 字幕生成,可以调整
字体
、位置
、颜色
、大小
,同时支持字幕描边
设置 -
支持 背景音乐,随机或者指定音乐文件,可设置
背景音乐音量
- 视频素材来源 高清,而且 无版权,也可以使用自己的 本地素材
-
支持 OpenAI、Moonshot、Azure、gpt4free、one-api、通义千问、Google Gemini、Ollama、
DeepSeek、 文心一言 等多种模型接入
- 中国用户建议使用 DeepSeek 或 Moonshot 作为大模型提供商(国内可直接访问,不需要VPN。注册就送额度,基本够用)
后期计划 📅
- GPT-SoVITS 配音支持
- 优化语音合成,利用大模型,使其合成的声音,更加自然,情绪更加丰富
- 增加视频转场效果,使其看起来更加的流畅
- 增加更多视频素材来源,优化视频素材和文案的匹配度
- 增加视频长度选项:短、中、长
- 支持更多的语音合成服务商,比如 OpenAI TTS
- 自动上传到YouTube平台
交流讨论 💬
视频演示 📺
竖屏 9:16
|
|
---|
6. JoshuaC215 / Agent-Service-Toolkit
Description: Full toolkit for running an AI agent service built with LangGraph, FastAPI, and Streamlit.
Link to Repository: Visit Repository
JoshuaC215 / agent-service-toolkit
Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit
🧰 AI Agent Service Toolkit
A full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit.
It includes a LangGraph agent, a FastAPI service to serve it, a client to interact with the service, and a Streamlit app that uses the client to provide a chat interface. Data structures and settings are built with Pydantic.
This project offers a template for you to easily build and run your own agents using the LangGraph framework. It demonstrates a complete setup from agent definition to user interface, making it easier to get started with LangGraph-based projects by providing a full, robust toolkit.
🎥 Watch a video walkthrough of the repo and app
Overview
Quickstart
Run directly in python
# At least one LLM API key is required
echo 'OPENAI_API_KEY=your_openai_api_key' >> .env
# uv is recommended but "pip install ." also works
pip
…7. Dnhkng / GLaDOS
Description: Personality Core for GLaDOS, a real-life implementation of the AI from the Portal series by Valve.
Link to Repository: Visit Repository
dnhkng / GLaDOS
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
GLaDOS Personality Core
This is a project dedicated to building a real-life version of GLaDOS!
NEW: If you want to chat or join the community, Join our discord! If you want to support, sponsor the project here!
LocalGLaDOS.mp4
Update 3-1-2025 Got GLaDOS running on an 8Gb SBC!
glados_update.mov
This is really tricky, so only for hardcore geeks! Checkout the 'rock5b' branch, and my OpenAI API for the RK3588 NPU system Don't expect support for this, it's in active development, and requires lots of messing about in armbian linux etc.
Goals
This is a hardware and software project that will create an aware, interactive, and embodied GLaDOS.
This will entail:
- Train GLaDOS voice generator
- Generate a prompt that leads to a realistic "Personality Core"
- Generate a medium- and long-term memory for GLaDOS (Probably a custom vector DB in a simpy Numpy array!)
- Give GLaDOS vision via a VLM (either a full…
8. Canner / WrenAI
Description: 🤖 Open-source GenBI AI Agent for generating Text-to-SQL, charts, reports, and more through interactive data chats.
Link to Repository: Visit Repository
Canner / WrenAI
🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI. 📈📊📋🧑💻
Wren AI
Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, and BI.
🕶 Try it yourself!
GenBI (Generative Business Intelligence)
Wren.BI.Reports.mov
Ask any questions
wren_intro_demo.mp4
👉 Try with your data on Wren AI Cloud or Install in your local environment
Supported LLM Models
Wren AI supports integration with various Large Language Models (LLMs), including but not limited to:
- OpenAI Models
- Azure OpenAI Models
- Google AI Studio – Gemini Models
- Vertex AI Models (Gemini + Anthropic)
- Bedrock Models
- Anthropic API Models
- Groq Models
- Ollama Models
- Databricks Models
Caution
The performance of Wren AI depends significantly on the capabilities of the LLM you choose. We strongly recommend using the most powerful model available for optimal results. Using less capable models may lead to reduced performance, slower response times, or inaccurate outputs.
🎯 Our Vision & Mission
Wren AI’s mission is to…
9. Ton-Blockchain / Ton
Description: Main TON monorepo.
Link to Repository: Visit Repository
ton-blockchain / ton
Main TON monorepo
Main TON monorepo, which includes the code of the node/validator, lite-client, tonlib, FunC compiler, etc.
The Open Network
The Open Network (TON) is a fast, secure, scalable blockchain focused on handling millions of transactions per second (TPS) with the goal of reaching hundreds of millions of blockchain users.
- To learn more about different aspects of TON blockchain and its underlying ecosystem check documentation
- To run node, validator or lite-server check Participate section
- To develop decentralised apps check Tutorials, FunC docs and DApp tutorials
- To work on TON check wallets, explorers, DEXes and utilities
- To interact with TON check APIs
Updates flow
-
master branch - mainnet is running on this stable branch.
Only emergency updates, urgent updates, or updates that do not affect the main codebase (GitHub workflows / docker images / documentation) are committed directly to this branch.
-
testnet branch…
10. Vikhyat / Moondream
Description: Tiny vision language model.
Link to Repository: Visit Repository
🌔 moondream
a tiny vision language model that kicks ass and runs anywhere
Examples
About
Moondream is a highly efficient open-source vision…
Honorable Mentions
Here are a few repositories that didn’t make the top 10 but deserve a mention this week:
- NVlabs / Sana – Efficient high-resolution image synthesis with Linear Diffusion Transformer.
- Fixie-ai / Ultravox – A fast multimodal LLM for real-time voice.
- Unclecode / Crawl4AI – 🚀🤖 Open-source LLM-friendly web crawler and scraper.
- Henrygd / Beszel – Lightweight server monitoring hub with historical data and alerts.
- Mufeedvh / Code2Prompt – CLI tool to convert your codebase into a single LLM prompt.
Conclusion
That concludes our Top 10 Trending GitHub Repositories for the week of January 20, 2025! Start the new year exploring these exciting projects, contribute where you can, and stay tuned for more in the weeks to come.
Personal Recommendation of the week:
(postiz-app)[https://github.com/gitroomhq/postiz-app]
gitroomhq / postiz-app
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖
Your ultimate AI social media scheduling tool
Postiz: An alternative to: Buffer.com, Hypefury, Twitter Hunter, Etc...
Postiz offers everything you need to manage your social media posts,
build an audience, capture leads, and grow your business
Explore the docs »
Register
·
Join Our Discord (devs only)
·
X
·
Gitroom
·
Telegram (Crypto)
hero.1.mp4
✨ Features
Intro
- Schedule all your social media posts (many AI features)
- Measure your work with analytics.
- Collaborate with other team members to exchange or buy posts.
- Invite your team members to collaborate, comment, and schedule posts.
- At the moment there is no difference between the hosted version to the self-hosted version
Tech Stack
- NX (Monorepo)
- NextJS (React)
- NestJS
- Prisma (Default to PostgreSQL)
- Redis (BullMQ)
- Resend (email notifications)
Quick Start
To have the project up and running, please follow the Quick Start Guide
Invest in the Postiz Coin :)
DMsTbeCfX1crgAse5tver98KAMarPWeP3d6U3Gmmpump
License
This repository's source…
If you're looking for a manage version of Postiz you can sign up for the service, that way we help this amazing open source:
Happy hacking!
Working on the audio version
Top comments (0)