Hundreds of OpenAI employees threaten to resign and join Microsoft
Catch me if you can! How to beat GPT-4 with a 13B model | LMSYS Org
TensorRT-LLM evaluation of the new H200 GPU achieves 11,819 tokens/s on Llama2-13B
...
ExUI - a lightweight web UI for ExLlamaV2 by turboderp
New "Context Shifting" feature in KoboldCPP 1.48
LangChain Web UI - ollama
Talk with LLaMA AI in your terminal
why reddit?
Phind V7 subjectively performing at GPT4 levels for coding
Min P sampler (an alternative to Top K/Top P) has been merged into llama.cpp
HUGE dataset released for open source use
I've started uploading quants of exllama v2 models, taking requests
Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray?
Text Generation Web-UI has been updated to CUDA 12.1, and with it new docker images are needed
Single Digit tokenization improves LLM math abilities by up to 70x
Are Local LLMs Useful in Incident Response? - SANS Internet Storm Center
Dolphin 2.0 based on mistral-7b released by Eric Hartford
Beginner questions thread