Back to home
ggml
2 articles tagged with this topic
MicrosoftVibeVoice
Microsoft VibeVoice Runs Without Python — AI De-Pythonization Hits Speech
Microsoft VibeVoice ported to pure C++ — no Python for inference. AI's de-Pythonization trend expands from text to voice, lowering enterprise voice AI
May 52 min read
ggmlllama.cpp
GGML Adds Q1_0 1-Bit Quantization: Run 8B Models at 1.15GB
GGML now supports Q1_0 1-bit quantization, shrinking Bonsai 8B models to 1.15GB for CPU-only inference.
Apr 62 min read