EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
CTC Disk Hotel DeepStream diffusers LaTeX CC QWEN CSV LoRA scipy SQL uWSGI v0.dev Color 算法题 CUDA TTS torchinfo LLAMA Password Domain Quantize BeautifulSoup v2ray Random Conda HaggingFace tar MD5 HuggingFace GGML VGG-16 音频 CAM Website Bin Permission Jupyter Land Qwen SPIE Sklearn Template Google Bipartite Heatmap 多线程 Logo Animate FP32 ChatGPT Numpy GoogLeNet Miniforge WAN Zip Gemma JSON 净利润 Plate Quantization Crawler 财报 论文 Bitcoin UI IndexTTS2 Pillow 报税 第一性原理 Video Agent UNIX Diagram Anaconda NLP PyCharm Mixtral Qwen2 公式 InvalidArgumentError 递归学习法 Base64 证件照 Algorithm Llama GPT4 NLTK Michelin Hungarian llama.cpp WebCrawler Paper transformers Data CEIR Statistics 多进程 Tiktoken 签证 printf SQLite YOLO ResNet-50 Cloudreve News Knowledge GIT Markdown AI Datetime GPTQ EXCEL RAR Augmentation BTC Pickle CV Nginx 顶会 Qwen2.5 Windows 搞笑 FP64 RGB Github FP16 Review Distillation OCR Tensor Plotly Python TensorFlow Claude Ubuntu FlashAttention Magnet SAM Django Docker Git logger Translation Image2Text FastAPI SVR NameSilo 图形思考法 Paddle Pytorch 论文速读 Streamlit XGBoost Input VPN PyTorch Card Pandas TensorRT Use Vim Linux icon Shortcut TSV Proxy Jetson hf uwsgi 飞书 PIP Ptyhon 继承 API XML LeetCode Clash LLM Search DeepSeek Firewall ONNX 域名 mmap COCO 图标 PDF FP8 OpenAI Attention git tqdm 腾讯云 关于博主 ModelScope 云服务器 Food Hilton Breakpoint Freesound 阿里云 Baidu Bert VSCode CLAP Interview 强化学习 Tracking PDB Excel Vmess OpenCV BF16 Dataset Transformers Web C++ Math Rebuttal git-lfs Safetensors 版权
站点统计

本站现有博文327篇,共被浏览833165

本站已经建立2538天!

热门文章
文章归档
回到顶部