EADST

Sharding and SafeTensors in Hugging Face Transformers

In the Hugging Face transformers library, managing large models efficiently is crucial, especially when working with limited disk space or specific file size requirements. Two key features that help with this are sharding and the use of SafeTensors.

Sharding

Sharding is the process of splitting a large model's weights into smaller files or "shards." This is particularly useful when dealing with large models that exceed file size limits or when you want to manage storage more effectively.

Usage

To shard a model during the saving process, you can use the max_shard_size parameter in the save_pretrained method. Here's an example:

# Save the model with sharding, setting the maximum shard size to 1GB
model.save_pretrained('./model_directory', max_shard_size="1GB")

In this example, the model's weights will be divided into multiple files, each not exceeding 1GB. This can make storage and transfer more manageable, especially when dealing with large-scale models.

SafeTensors

The safetensors library provides a new format for storing tensors in a safe and efficient way. Unlike traditional formats like PyTorch's .pt files, SafeTensors ensures that the tensor data cannot be accidentally executed as code, offering an additional layer of security. This is particularly important when sharing models across different systems or with the community.

Usage

To save a model using SafeTensors, simply specify the safe_serialization parameter when saving:

# Save the model using SafeTensors format
model.save_pretrained('./model_directory', safe_serialization=True)

This will create files with the .safetensors extension, ensuring the saved tensors are stored safely.

Combining Sharding and SafeTensors

You can combine both sharding and SafeTensors to save a large model securely and efficiently:

# Save the model with sharding and SafeTensors
model.save_pretrained('./model_directory', max_shard_size="1GB", safe_serialization=True)

This setup splits the model into shards, each in the SafeTensors format, offering both manageability and security.

Conclusion

By leveraging sharding and SafeTensors, Hugging Face transformers users can handle large models more effectively. Sharding helps manage file sizes, while SafeTensors ensures the safe storage of tensor data. These features are essential for anyone working with large-scale models, providing both practical and security benefits.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Qwen2.5 TensorFlow LLM diffusers GPTQ QWEN 签证 CAM uWSGI OpenAI Numpy 飞书 TSV XML Review ModelScope CEIR C++ YOLO Michelin Streamlit Vmess Bert 算法题 Card Plate 音频 Breakpoint Pillow UNIX FP8 Random LoRA Git Dataset 报税 PDF NameSilo CV Password Domain Linux 顶会 Github Miniforge Quantize Paper TensorRT Qwen2 InvalidArgumentError tar Pandas TTS 递归学习法 图标 Zip Bin 证件照 CC Search icon Vim EXCEL v0.dev PyCharm Pytorch 净利润 AI Attention Paddle Cloudreve GoogLeNet NLP Use torchinfo Algorithm BF16 强化学习 Base64 VPN FP64 Crawler WAN VGG-16 图形思考法 Statistics Translation Math Agent FastAPI News ResNet-50 Transformers Template Firewall ONNX FlashAttention Hotel DeepSeek 域名 第一性原理 logger COCO 多进程 scipy Land Markdown Permission Image2Text RAR Tracking Bitcoin 论文 printf Rebuttal Windows Docker Baidu Nginx uwsgi OCR 关于博主 Proxy Datetime Bipartite Google Logo Diagram Video Conda Knowledge 云服务器 版权 API CLAP SVR Distillation Input Safetensors LeetCode Plotly NLTK Llama Data Pickle LLAMA Claude Clash Interview Jupyter DeepStream Freesound 继承 Excel BTC CUDA Qwen Tiktoken mmap OpenCV Website JSON FP32 HaggingFace transformers git SAM Gemma HuggingFace v2ray Ubuntu 公式 FP16 Disk Magnet CSV BeautifulSoup CTC 阿里云 多线程 Food RGB SPIE Heatmap Hungarian IndexTTS2 git-lfs ChatGPT 腾讯云 Augmentation XGBoost Django UI Python Mixtral 财报 GIT Shortcut Jetson Color PyTorch Animate tqdm PIP WebCrawler hf MD5 Ptyhon llama.cpp SQL 论文速读 LaTeX Hilton Sklearn 搞笑 SQLite PDB Quantization Tensor GPT4 VSCode GGML Anaconda Web
站点统计

本站现有博文327篇,共被浏览833838

本站已经建立2538天!

热门文章
文章归档
回到顶部