EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
DeepStream Math tqdm Use Video SQL Freesound NameSilo Agent Paddle Interview SPIE 音频 论文 Quantization 云服务器 LLAMA Distillation Plate Card Transformers Proxy GIT WAN Tracking Qwen2.5 Land 强化学习 TensorRT Food FP8 PyCharm tar Claude mmap torchinfo git-lfs 报税 TensorFlow Attention Rebuttal Numpy NLP QWEN 图标 Magnet Hotel VPN COCO Gemma Hungarian Logo GGML DeepSeek 多进程 证件照 ResNet-50 diffusers Disk EXCEL Dataset BF16 Vim Algorithm Qwen2 TSV InvalidArgumentError Clash Review 第一性原理 Qwen YOLO icon OpenCV AI PDF Heatmap Markdown Streamlit Data CEIR Conda PyTorch 顶会 HaggingFace CSV VSCode Michelin BTC CC 签证 RAR BeautifulSoup 搞笑 logger News CAM Paper Miniforge 递归学习法 Knowledge 图形思考法 算法题 Base64 Vmess RGB 腾讯云 Domain ONNX Linux Hilton Tiktoken Baidu ModelScope CV llama.cpp Diagram Nginx Template Breakpoint FlashAttention scipy Random v0.dev 版权 hf PIP Bipartite SAM UNIX printf NLTK Bert LoRA transformers OCR Mixtral XGBoost Tensor Ubuntu Anaconda Cloudreve v2ray LLM Color Google 域名 MD5 LeetCode IndexTTS2 关于博主 Input Crawler Pytorch Translation GoogLeNet GPTQ UI 公式 财报 Statistics WebCrawler API CUDA Jupyter Pandas VGG-16 多线程 Permission C++ Git FP32 FastAPI Sklearn Safetensors Bin FP16 Datetime Quantize Website Shortcut Pickle JSON Password FP64 Windows TTS Ptyhon Animate CTC SQLite Image2Text Python ChatGPT Web HuggingFace Firewall GPT4 uwsgi git 继承 Excel Django SVR Plotly Bitcoin uWSGI 飞书 OpenAI Jetson XML 净利润 Github Pillow Augmentation Search Zip 论文速读 LaTeX Docker 阿里云 PDB CLAP Llama
站点统计

本站现有博文327篇,共被浏览833144

本站已经建立2538天!

热门文章
文章归档
回到顶部