EADST

Pytorch: Freeze layers to Finetune the Model

Pytorch: Freeze layers to Finetune the Model.

for k, v in model.named_parameters():
    print(k) # check the layer name
for k, v in model.named_parameters():
    if k in ["last.weight", "last.bias"]: # freeze the layer with the given name list
        v.requires_grad = True
    else:
        v.requires_grad = False
       
      
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Windows YOLO diffusers llama.cpp 继承 Web Animate git Algorithm Github CEIR C++ PDB Transformers Website 飞书 scipy Linux ModelScope UNIX icon GPTQ FP8 Crawler 关于博主 CTC Firewall EXCEL Python Agent Translation ResNet-50 LoRA 腾讯云 IndexTTS2 Docker FastAPI printf JSON GGML Tracking GIT v2ray Domain NameSilo Input Hungarian Shortcut Image2Text 云服务器 Quantization Paddle OpenAI TTS Data News Dataset Permission Streamlit Markdown FP16 Llama Logo UI 顶会 Base64 Baidu Hilton 版权 Pillow Tiktoken Paper OpenCV DeepStream 公式 LLAMA 净利润 Password DeepSeek CLAP git-lfs Use hf PyCharm Search XML Qwen2 Quantize CUDA CAM 报税 论文 Proxy Django TensorFlow BTC Pickle Datetime QWEN OCR RAR HuggingFace ChatGPT Math PDF Land Cloudreve CC NLP Sklearn Plotly SPIE Breakpoint Pytorch 图标 Color 图形思考法 递归学习法 logger ONNX 证件照 Interview tqdm Nginx Conda COCO 强化学习 Clash VGG-16 Excel Plate TSV Anaconda NLTK 第一性原理 Card Vmess Ptyhon TensorRT Zip Jupyter FlashAttention Food Augmentation Vim MD5 Bipartite uwsgi BF16 Qwen2.5 API uWSGI Distillation 多进程 GPT4 XGBoost Heatmap BeautifulSoup Safetensors SQLite Google Jetson AI Bitcoin SAM LeetCode Bert PyTorch VSCode Qwen Review FP64 Miniforge 搞笑 RGB Hotel CV WAN Knowledge 签证 transformers Attention Random Freesound PIP 财报 torchinfo 音频 GoogLeNet CSV Numpy VPN Pandas Rebuttal 域名 Michelin Gemma Tensor Statistics mmap FP32 HaggingFace Diagram Bin Template Mixtral InvalidArgumentError 算法题 SVR tar Claude 阿里云 LLM WebCrawler Video LaTeX 论文速读 SQL v0.dev Magnet Ubuntu Disk 多线程 Git
站点统计

本站现有博文327篇,共被浏览832960

本站已经建立2538天!

热门文章
文章归档
回到顶部