EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Rebuttal Search Diagram FastAPI ResNet-50 SQLite Qwen VGG-16 版权 Web ModelScope Windows GGML C++ LLAMA transformers AI GoogLeNet UI tqdm Ubuntu CUDA Algorithm LeetCode EXCEL Plotly NLP Llama Numpy XML Video Transformers Safetensors Template uwsgi Tracking CTC WebCrawler Nginx 腾讯云 域名 Distillation Paper Bitcoin Breakpoint uWSGI scipy Sklearn News Baidu SAM Excel git-lfs Statistics Animate OpenAI YOLO ChatGPT 阿里云 CAM Random Input Pytorch Quantize GIT 公式 git SVR SPIE tar 净利润 Paddle Proxy 飞书 多线程 Google PDF RGB 音频 SQL Base64 搞笑 Hungarian NLTK torchinfo GPTQ HaggingFace PyCharm PDB Dataset 顶会 Git Docker Michelin Crawler PIP IndexTTS2 Disk VSCode HuggingFace 递归学习法 Color Image2Text 第一性原理 Plate Agent Linux Food Miniforge 签证 Datetime 云服务器 报税 Knowledge Zip CEIR v0.dev PyTorch Markdown Augmentation Pandas Review Attention Ptyhon 算法题 Hotel ONNX GPT4 财报 InvalidArgumentError Password Claude Mixtral TSV 图形思考法 printf RAR BTC 多进程 BeautifulSoup Anaconda Card Tensor DeepStream Bin Bert OCR hf TensorRT Math Interview Firewall Django BF16 CLAP VPN LLM Translation Github Freesound Cloudreve v2ray Conda FP64 JSON Qwen2 Land LoRA Pillow Bipartite TTS API Jupyter DeepSeek Hilton 关于博主 Tiktoken llama.cpp Clash FP16 Use Streamlit Magnet mmap 继承 QWEN OpenCV Vim CSV FP32 Vmess logger Permission CV 强化学习 Quantization COCO TensorFlow Pickle Heatmap Domain Jetson MD5 证件照 diffusers icon Data Qwen2.5 NameSilo Shortcut Logo FP8 UNIX LaTeX 图标 CC XGBoost Gemma WAN Website Python FlashAttention
站点统计

本站现有博文323篇,共被浏览795911

本站已经建立2493天!

热门文章
文章归档
回到顶部