EADST

FP8位数解析

在 AI 模型越来越庞大的今天,我们面临的不仅是算力挑战,更有带宽、能耗和模型部署的瓶颈。正因如此,更高效的数值表示方式成为突破口,其中最受关注的就是 FP8(8位浮点数)格式。

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
About Me
XD
Goals determine what you are going to be.
Category
标签云
Template Tensor Random Bitcoin hf Gemma Nginx PDB Knowledge Domain Shortcut Logo Baidu 腾讯云 COCO LoRA 强化学习 Plotly 继承 财报 Vim tar 云服务器 v2ray Clash Statistics printf scipy SPIE HuggingFace tqdm Michelin Rebuttal GPT4 FlashAttention 多进程 关于博主 Diagram Animate 公式 CC Website DeepSeek ModelScope VPN Jetson TTS Heatmap Attention LaTeX CSV NLTK Tiktoken 图形思考法 Interview OCR Conda 图标 Quantization Review 多线程 XML git-lfs CAM PyCharm PIP Linux GoogLeNet torchinfo JSON NameSilo Ptyhon git diffusers WAN 搞笑 Input Paper YOLO FP64 Algorithm TensorRT Video Hilton Datetime XGBoost Web Mixtral Dataset RAR Image2Text Hotel Paddle CV EXCEL 签证 llama.cpp SAM InvalidArgumentError Github FP8 AI OpenCV Hungarian HaggingFace Transformers Card 第一性原理 Distillation 飞书 论文 Qwen Plate Magnet BF16 ms-swift Qwen2.5 TSV logger 顶会 Disk UI 递归学习法 Use FP16 Miniforge 证件照 净利润 版权 Bin Pillow FP32 Jupyter Vmess 音频 Git GGML Translation Tracking CUDA Agent Password GPTQ GIT Math Google Windows Django Bert BeautifulSoup MD5 transformers RGB Firewall Zip UNIX Markdown Augmentation LeetCode C++ Cloudreve Numpy BTC ONNX CTC IndexTTS2 VGG-16 ResNet-50 SVR 论文速读 API WebCrawler DeepStream Pandas FastAPI Bipartite Breakpoint Llama PyTorch Search v0.dev Python Ubuntu 报税 Land Permission Anaconda NLP Color Docker Data ChatGPT OpenAI mmap Crawler Freesound CEIR SQLite 域名 CLAP Food News Claude TensorFlow Streamlit uwsgi Qwen2 阿里云 Safetensors icon VSCode Base64 Pickle QWEN LLM 算法题 Pytorch PDF Excel SQL LLAMA Quantize Sklearn Proxy uWSGI
站点统计

本站现有博文329篇,共被浏览859513

本站已经建立2568天!

热门文章
文章归档
回到顶部