EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Zip OpenAI Bipartite Magnet Permission Vmess Nginx 继承 BeautifulSoup Use Docker 域名 Crawler Paper FP32 Agent FP8 Tensor uwsgi Review 算法题 Web SQL CLAP torchinfo Paddle Qwen2.5 AI Baidu Hungarian PDF TSV Github Windows Jetson Interview QWEN NLTK Heatmap Excel Freesound BTC 第一性原理 论文 财报 Disk 搞笑 顶会 Jupyter WAN 音频 UI TTS printf VGG-16 VPN llama.cpp Firewall VSCode DeepStream Streamlit Qwen Data Cloudreve uWSGI Markdown LeetCode icon Bert Linux LoRA 图形思考法 Pillow Breakpoint C++ 图标 Random Input InvalidArgumentError CUDA diffusers Python Logo Tracking CSV 净利润 FP64 LaTeX 报税 Distillation Base64 Shortcut Dataset YOLO JSON LLAMA Ubuntu FP16 GIT EXCEL Hotel CEIR Ptyhon FlashAttention Anaconda git-lfs Color Animate SPIE Quantization 多进程 飞书 Diagram Qwen2 Transformers 多线程 Attention ModelScope Safetensors XML PyTorch Card UNIX SQLite Numpy Google Plate 版权 RGB MD5 PIP Tiktoken XGBoost Bin TensorRT HuggingFace Knowledge v2ray git TensorFlow Pandas Domain v0.dev BF16 Augmentation Algorithm Vim PDB mmap Statistics Pytorch RAR IndexTTS2 Llama Michelin Mixtral SAM Claude Clash 强化学习 云服务器 NLP Datetime FastAPI 腾讯云 ONNX ChatGPT API OCR 论文速读 COCO CTC ResNet-50 Food tar Django CC Git Translation CAM hf NameSilo Sklearn Miniforge Plotly Conda tqdm Password PyCharm scipy GGML HaggingFace SVR GoogLeNet Template Quantize CV Gemma Rebuttal 签证 递归学习法 transformers Proxy Pickle WebCrawler Land GPT4 Bitcoin logger 关于博主 GPTQ Hilton Video 阿里云 Search 公式 Image2Text LLM DeepSeek OpenCV 证件照 Website News Math
站点统计

本站现有博文327篇,共被浏览833547

本站已经建立2538天!

热门文章
文章归档
回到顶部