Evolving New Foundation Models: Unleashing the Power of Automating Model Development
GaLore: Advancing Large Model Training on Consumer-grade Hardware
Mistral 7B v0.2 Base (released at SHACK15sf hackathon)
Ollama now supports AMD graphics cards
T-Ragx - Enhancing Translation with RAG-Powered LLMs
My personal collection of interesting models I've quantized from the past week (yes, just week)
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Gemma 2B vs Phi-2
NVIDIA Chat With RTX
Meet ‘Smaug-72B’: The new king of open-source AI
itsme2417/PolyMind: A multimodal, function calling powered LLM webui.
Introducing Nomic Embed: A Truly Open Embedding Model
Uncensored Mixtral 8x7B with 4 GB of VRAM
Mistral CEO confirms ‘leak’ of new open source AI model nearing GPT-4 performance
Meta releases ‘Code Llama 70B’, an open-source behemoth to rival private AI development
A question about running LLMs with an AMD card
Noob here, what's the best overall model for getting started with ?
Zuckerberg wants to build artificial general intelligence with 350K Nvidia H100 GPUs
InternLM2 models llama-fied
Building a fully local LLM voice assistant to control my smart home