EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
WebCrawler Diagram LLAMA PIP Bert Card LaTeX CAM 腾讯云 Bin SQL TSV Safetensors Food Base64 Domain Image2Text Github ResNet-50 ChatGPT 报税 Bipartite 财报 PDF Search Permission RGB CC Plate Clash Statistics UI Claude Git Template Jupyter AI GPTQ llama.cpp NLTK 签证 强化学习 Pandas 关于博主 GPT4 tqdm Gemma API 净利润 PDB 多进程 VSCode 递归学习法 Anaconda 域名 XML diffusers Knowledge Video UNIX MD5 ONNX Excel TTS Qwen Numpy Tiktoken Windows OCR FP64 第一性原理 EXCEL Crawler CEIR NameSilo v2ray FP32 CSV Agent Website hf Docker Land transformers Web VPN 版权 证件照 Freesound Jetson Miniforge Streamlit SPIE Ubuntu Logo BF16 Markdown 飞书 IndexTTS2 Qwen2.5 Bitcoin SAM Vmess NLP Sklearn git-lfs LeetCode VGG-16 Transformers FastAPI C++ PyTorch Rebuttal Django 多线程 Algorithm Interview FP8 OpenAI JSON SQLite Hotel Paper Firewall Plotly Augmentation BTC XGBoost Vim scipy News Attention SVR ModelScope Pillow 图形思考法 Disk Password QWEN printf mmap Dataset Mixtral torchinfo Input Paddle InvalidArgumentError FP16 GoogLeNet Baidu DeepStream Zip Review tar Llama 阿里云 GIT Conda Heatmap Google Color icon Math BeautifulSoup CTC TensorFlow 搞笑 TensorRT PyCharm Cloudreve LoRA Quantize FlashAttention 音频 Ptyhon Magnet Pytorch Qwen2 Random Tensor Pickle Distillation HaggingFace Animate 继承 GGML uwsgi YOLO Nginx OpenCV uWSGI Tracking logger CUDA Breakpoint LLM Hilton v0.dev 图标 WAN 顶会 COCO 算法题 Python HuggingFace git Linux Michelin Quantization CLAP 公式 CV Proxy RAR 云服务器 Use Translation Datetime DeepSeek Hungarian Data Shortcut
站点统计

本站现有博文323篇,共被浏览795379

本站已经建立2493天!

热门文章
文章归档
回到顶部