EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Git Linux WAN Bitcoin CV WebCrawler Tiktoken GPT4 Conda hf Pytorch RGB uwsgi Windows Color XGBoost Augmentation logger Markdown Hungarian Quantize CTC Distillation BTC BeautifulSoup LoRA TSV Rebuttal Hilton 腾讯云 Data Anaconda Password Statistics mmap diffusers TensorFlow VPN OCR 图形思考法 HaggingFace Bipartite Base64 音频 tar 版权 Qwen2.5 Pandas XML Web FP32 第一性原理 Baidu Freesound Input 论文 Miniforge Pickle FP16 PDB Attention Crawler SQLite LLM 论文速读 Jetson Diagram Card Streamlit RAR 域名 Land DeepStream Clash Tracking UNIX Quantization ResNet-50 财报 Video Math Hotel v0.dev 签证 Vim PDF Food Knowledge COCO TTS CLAP Excel Django scipy VGG-16 TensorRT Heatmap OpenAI 证件照 Random AI Vmess Domain Safetensors NLTK Website transformers Shortcut Llama 净利润 Paddle Breakpoint printf Animate 多线程 YOLO Search FlashAttention Github Magnet Template ChatGPT uWSGI SVR API git-lfs News HuggingFace FastAPI SQL 递归学习法 Bert 阿里云 ONNX Dataset 云服务器 公式 Bin CUDA Plotly 关于博主 CC 强化学习 InvalidArgumentError 图标 Gemma Sklearn FP8 搞笑 CAM LaTeX Python Qwen PyCharm CEIR DeepSeek MD5 llama.cpp CSV Mixtral QWEN GoogLeNet Pillow EXCEL ModelScope Permission JSON UI Image2Text Cloudreve 多进程 Numpy Datetime VSCode Claude PIP Algorithm torchinfo NameSilo git Zip Proxy Ptyhon Use LLAMA Transformers GGML Paper Ubuntu Google GIT 报税 Michelin Jupyter 算法题 Translation OpenCV FP64 C++ Review Plate ms-swift Nginx icon GPTQ Agent LeetCode Disk Qwen2 顶会 IndexTTS2 Firewall PyTorch Tensor Interview 飞书 BF16 继承 Logo SAM v2ray tqdm Docker SPIE NLP
    站点统计

    本站现有博文329篇,共被浏览859522

    本站已经建立2568天!

    热门文章
    文章归档
    回到顶部