EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
scipy Miniforge git Qwen Bipartite Markdown WAN Permission 云服务器 Interview XML Agent FP32 Pickle Excel SPIE Plotly Anaconda Python LoRA 继承 Statistics InvalidArgumentError Translation Numpy SAM Data uWSGI PDB Disk Claude Clash VSCode hf CTC Baidu GoogLeNet Datetime CEIR AI Docker Diagram Pillow 阿里云 BTC mmap Video Michelin PyTorch CLAP Google Food Paddle FP16 Review Shortcut Github HaggingFace Web Freesound Gemma 财报 YOLO TSV 强化学习 Rebuttal BF16 Quantization torchinfo SQL Jupyter 多线程 Augmentation logger icon Heatmap Qwen2 算法题 FastAPI Search Zip 图形思考法 腾讯云 NLTK Domain Nginx Plate tar XGBoost UI Ubuntu TensorRT Dataset Quantize 证件照 Random Land Cloudreve Base64 关于博主 Distillation ChatGPT Vim Tensor v0.dev tqdm OpenAI 递归学习法 Qwen2.5 Template OpenCV QWEN Hilton PyCharm CUDA Tiktoken Proxy PDF Animate 音频 VPN News GPT4 公式 Website JSON v2ray printf COCO RAR git-lfs TensorFlow Input Linux Image2Text 搞笑 Sklearn RGB Jetson Math LLM Attention WebCrawler LaTeX uwsgi VGG-16 第一性原理 GPTQ IndexTTS2 Tracking GGML DeepSeek Card Pandas Algorithm Magnet Windows Use Color LeetCode 域名 FlashAttention Bert Paper Transformers LLAMA Knowledge CSV Pytorch Safetensors 多进程 Llama Bin FP8 签证 图标 Bitcoin Breakpoint TTS SVR MD5 BeautifulSoup Django ModelScope Streamlit transformers llama.cpp Firewall Mixtral Vmess PIP CC CV Hotel FP64 NLP Logo SQLite DeepStream UNIX OCR GIT CAM 版权 HuggingFace Conda 顶会 EXCEL Hungarian Password API ONNX Crawler 报税 C++ NameSilo 飞书 净利润 Git Ptyhon diffusers ResNet-50
站点统计

本站现有博文323篇,共被浏览795667

本站已经建立2493天!

热门文章
文章归档
回到顶部