llama cpp github

ggerganov/llama.cpp: LLM inference in C/C++ - GitHub github.com › ggerganov › llama

llama.cpp web server is a lightweight OpenAI API compatible HTTP server that can be used to serve local models and easily connect them to existing clients. Build llama.cpp locally · LLaMA.cpp HTTP Server · Changelog : `llama-server...

Releases · ggerganov/llama.cpp - GitHub github.com › ggerganov › llama.cpp › releases

10 часов назад · LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub.

Python bindings for llama.cpp - GitHub github.com › abetlen › llama-cpp-python

Simple Python bindings for @ggerganov's llama.cpp library. This package provides: High-level Python API for text completion, OpenAI compatible web server. Llama_cpp... · Llama_cpp.py · Llama-cpp-python · README.md

llama.cpp/docs/build.md at master · ggerganov/llama ... - GitHub github.com › ggerganov › llama.cpp › blob › b...

llama.cpp based on SYCL is used to support Intel GPU (Data Center Max series, Flex series, Arc series, Built-in GPU and iGPU).

llama.cpp/examples/server/README.md at master ... - GitHub github.com › ggerganov › llama.cpp › blob › R...

Fast, lightweight, pure C/C++ HTTP server based on httplib, nlohmann::json and llama.cpp. Set of LLM REST APIs and a simple web front end to interact with ...

Pull requests · ggerganov/llama.cpp - GitHub github.com › ggerganov › llama.cpp › pulls

LLM inference in C/C++. Contribute to ggerganov/llama.cpp development by creating an account on GitHub.

llama.cpp - Wikipedia en.wikipedia.org › wiki › Llama

llama.cpp is an open source software library written mostly in C++ that performs inference on various large language models such as Llama.

Запросы по теме