Skip Navigation

LocalLLaMA

Members

3,601
Posts

329
Active Today

5
Created

2 yr. ago

Sort

View

noneabove1182 @sh.itjust.works
2y ago

Hundreds of OpenAI employees threaten to resign and join Microsoft

www.theverge.com /2023/11/20/23968988/openai-employees-resignation-letter-microsoft-sam-altman

8
noneabove1182 @sh.itjust.works
2y ago

Catch me if you can! How to beat GPT-4 with a 13B model | LMSYS Org

lmsys.org /blog/2023-11-14-llm-decontaminator/

0
noneabove1182 @sh.itjust.works
2y ago

TensorRT-LLM evaluation of the new H200 GPU achieves 11,819 tokens/s on Llama2-13B

github.com /NVIDIA/TensorRT-LLM/blob/release/0.5.0/docs/source/blogs/H200launch.md

0
CoderSupreme @programming.dev
2y ago

...

0
noneabove1182 @sh.itjust.works
2y ago

ExUI - a lightweight web UI for ExLlamaV2 by turboderp

github.com /turboderp/exui

0
CoderSupreme @programming.dev
2y ago

...

2
rufus @discuss.tchncs.de
2y ago

New "Context Shifting" feature in KoboldCPP 1.48

github.com /LostRuins/koboldcpp/releases/tag/v1.48

2
suoko @feddit.it
2y ago

LangChain Web UI - ollama

github.com /suoko/ollama/blob/main/examples/langchain-python-langchain-webui/readme.md

0
Even_Adder @lemmy.dbzer0.com
2y ago

Talk with LLaMA AI in your terminal

github.com /ggerganov/whisper.cpp/tree/master/examples/talk-llama

0
acec @lemmy.world
2y ago

why reddit?

13
noneabove1182 @sh.itjust.works
2y ago

Phind V7 subjectively performing at GPT4 levels for coding

news.ycombinator.com /item

10
noneabove1182 @sh.itjust.works
2y ago

Min P sampler (an alternative to Top K/Top P) has been merged into llama.cpp

github.com /ggerganov/llama.cpp/pull/3841

0
noneabove1182 @sh.itjust.works
2y ago

HUGE dataset released for open source use

together.ai /blog/redpajama-data-v2

4
noneabove1182 @sh.itjust.works
2y ago

I've started uploading quants of exllama v2 models, taking requests

huggingface.co /bartowski

0
rufus @discuss.tchncs.de
2y ago

Article from October 3

Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray?

www.zdnet.com /article/nearly-10-of-people-ask-ai-chatbots-for-explicit-content-will-it-lead-llms-astray/

18
noneabove1182 @sh.itjust.works
2y ago

Text Generation Web-UI has been updated to CUDA 12.1, and with it new docker images are needed

0
noneabove1182 @sh.itjust.works
2y ago

Single Digit tokenization improves LLM math abilities by up to 70x

twitter.com /andrew_n_carr/status/1714326003030638848

2
ylai @lemmy.ml
2y ago

Are Local LLMs Useful in Incident Response? - SANS Internet Storm Center

isc.sans.edu /diary/rss/30274

0
noneabove1182 @sh.itjust.works
2y ago

Dolphin 2.0 based on mistral-7b released by Eric Hartford

huggingface.co /ehartford/dolphin-2.0-mistral-7b

1
noneabove1182 @sh.itjust.works
2y ago

Beginner questions thread

28

418 active users