EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Shortcut Claude NLP tar PyTorch Ubuntu Paddle 多线程 Statistics XGBoost GIT FP8 uwsgi RGB Baidu Google PyCharm Bitcoin 证件照 ChatGPT BTC Pickle CEIR LoRA 递归学习法 Mixtral DeepSeek SQLite SQL LLM COCO 财报 FP64 顶会 SAM Diagram Image2Text CUDA CV Qwen2 Hilton Zip API 强化学习 Excel Website v0.dev Paper Augmentation Card YOLO printf HuggingFace VSCode Windows HaggingFace Python TensorFlow FP32 CAM Distillation Attention OpenAI 飞书 CSV TSV Llama LLAMA Pytorch ONNX PDB 阿里云 v2ray CLAP IndexTTS2 Knowledge transformers Safetensors GPTQ PIP Input MD5 Linux OCR Heatmap Miniforge Quantization Crawler git Animate Michelin mmap Markdown Agent 图形思考法 VGG-16 BF16 Food XML Video Jupyter Pandas Math Hotel Breakpoint Tiktoken Bert diffusers Use tqdm Gemma RAR Quantize Random InvalidArgumentError 算法题 WAN BeautifulSoup 关于博主 腾讯云 SPIE Review Bipartite Streamlit C++ 云服务器 Land Color torchinfo Plotly Ptyhon Django Magnet Vmess UNIX Numpy 多进程 llama.cpp Git Cloudreve DeepStream Firewall Tensor Proxy Data NameSilo VPN Base64 CTC SVR GGML Domain 图标 Sklearn Docker logger Plate Bin TensorRT Interview Dataset FastAPI CC Template 搞笑 Permission NLTK GPT4 Anaconda OpenCV 版权 Password Hungarian QWEN scipy Nginx News Web 签证 净利润 Conda 继承 Clash LaTeX JSON LeetCode Tracking Qwen 公式 GoogLeNet Freesound TTS WebCrawler icon Qwen2.5 音频 ModelScope 报税 Jetson Github Logo hf UI ResNet-50 Datetime 域名 Rebuttal uWSGI PDF Pillow EXCEL Vim Transformers FlashAttention Search 第一性原理 git-lfs Disk AI Algorithm FP16 Translation
站点统计

本站现有博文323篇,共被浏览795433

本站已经建立2493天!

热门文章
文章归档
回到顶部