EADST

LLAMA Model Save with INT8 Format

LLAMA Model Save with INT8 Format

from transformers import BitsAndBytesConfig
from transformers import AutoModelForCausalLM

config = BitsAndBytesConfig(
    load_in_8bit=True,
)
path = "/home/llm/model/path/"
model = AutoModelForCausalLM.from_pretrained(path, device_map="cpu", quantization_config=config)
model.save_pretrained("model_save_folder-8bit")
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
PDF XGBoost Statistics Logo Pillow TensorRT ChatGPT Attention Miniforge 图形思考法 Markdown Python 飞书 Ubuntu TensorFlow Michelin 多进程 RGB Review COCO FlashAttention 强化学习 Shortcut Claude 音频 Crawler Tensor 图标 HuggingFace Rebuttal logger Hilton Freesound Sklearn tqdm DeepSeek InvalidArgumentError AI GoogLeNet Image2Text TSV VSCode Linux Land PyCharm Pickle Google News icon git-lfs Bipartite FP32 NLTK CEIR Card Jupyter QWEN Animate hf Streamlit Dataset OpenAI Plate Video Search SVR EXCEL LLAMA Pytorch uwsgi FP16 Translation Proxy Hotel Data Git 证件照 MD5 v0.dev OpenCV 报税 VGG-16 Paddle UNIX Diagram CUDA Excel diffusers UI Agent WebCrawler llama.cpp LLM 云服务器 GPT4 Permission SQLite 财报 torchinfo GPTQ LaTeX OCR Ptyhon CV Mixtral ModelScope 顶会 Tiktoken Datetime 腾讯云 Interview BF16 TTS SAM Zip Random Distillation 搞笑 SQL Conda Hungarian Django transformers Tracking Nginx git Bitcoin NameSilo PIP API Clash RAR Numpy Cloudreve NLP ResNet-50 CAM 公式 JSON HaggingFace Password GIT Vim Plotly Anaconda BeautifulSoup Disk uWSGI Heatmap Docker Use Breakpoint C++ 阿里云 Paper Qwen2.5 Domain SPIE 继承 CLAP 域名 LoRA v2ray Baidu Augmentation FP8 Windows Math Input FastAPI Magnet Knowledge Transformers VPN Template 版权 Color 算法题 关于博主 Web tar 论文速读 Qwen DeepStream Pandas ONNX IndexTTS2 签证 LeetCode FP64 第一性原理 净利润 mmap XML printf Bert Github BTC Base64 递归学习法 Safetensors scipy Bin Algorithm 论文 CTC YOLO Jetson Qwen2 Gemma Website Firewall Llama PDB CC PyTorch WAN Food GGML Quantization Quantize CSV Vmess 多线程
站点统计

本站现有博文327篇,共被浏览833103

本站已经建立2538天!

热门文章
文章归档
回到顶部