EADST

FP8位数解析

在 AI 模型越来越庞大的今天,我们面临的不仅是算力挑战,更有带宽、能耗和模型部署的瓶颈。正因如此,更高效的数值表示方式成为突破口,其中最受关注的就是 FP8(8位浮点数)格式。

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
About Me
XD
Goals determine what you are going to be.
Category
标签云
WAN PyCharm CLAP Crawler transformers CTC MD5 IndexTTS2 Vim Permission OpenAI Git Agent Nginx Vmess Django Quantize Freesound 报税 Pickle News UNIX 版权 FP64 torchinfo Translation Math WebCrawler 顶会 v2ray Heatmap 证件照 ONNX 多进程 C++ GPTQ FP32 算法题 Bert hf GPT4 Transformers Image2Text git Base64 PDF Safetensors Qwen2 Password CC LeetCode 财报 PyTorch Streamlit HaggingFace 腾讯云 Sklearn Distillation Plotly Paddle Interview Breakpoint Statistics NLP Gemma Jetson 飞书 LaTeX Domain Bipartite CSV Website 图标 FlashAttention Random Zip tar Anaconda 继承 AI Markdown BTC BeautifulSoup ms-swift Baidu FP8 SVR Search 递归学习法 Hotel RGB llama.cpp Docker Animate DeepStream QWEN SPIE Pillow HuggingFace 签证 PIP diffusers 搞笑 LoRA GGML icon GIT Augmentation Claude 第一性原理 ResNet-50 Plate 音频 Diagram 强化学习 Miniforge Pandas LLAMA 净利润 Logo Attention TSV Pytorch Shortcut Github Ptyhon Proxy Template EXCEL API Numpy Bitcoin Ubuntu CAM Jupyter Conda Data SAM 阿里云 Cloudreve NLTK uWSGI Linux Firewall DeepSeek ModelScope 论文 TensorRT Clash SQL YOLO logger mmap OpenCV ChatGPT VGG-16 Algorithm Video Python TensorFlow OCR UI Quantization CEIR v0.dev 云服务器 Mixtral FastAPI Excel Web GoogLeNet Color 论文速读 scipy CUDA JSON LLM Card 多线程 图形思考法 Google RAR tqdm XML printf Land TTS Tracking Food Windows Llama Qwen FP16 Michelin PDB Review Rebuttal Magnet Bin Disk Hungarian BF16 Datetime Qwen2.5 Tensor 关于博主 CV SQLite VSCode git-lfs Tiktoken 域名 公式 uwsgi Knowledge COCO Hilton Dataset InvalidArgumentError NameSilo XGBoost Paper Input VPN Use
站点统计

本站现有博文329篇,共被浏览859553

本站已经建立2568天!

热门文章
文章归档
回到顶部