EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Qwen2 OpenCV CC MD5 Review WAN Freesound Dataset Proxy Shortcut News Transformers NameSilo TSV 阿里云 Python tqdm PIP GIT SAM diffusers Pickle Numpy Anaconda Bin PDF 搞笑 OpenAI v2ray PyCharm 签证 Vmess Jupyter CEIR ResNet-50 Web BTC LLM Docker Tensor Color Domain hf Use 云服务器 XML 公式 FP64 Tracking Claude 净利润 Knowledge BeautifulSoup Streamlit API Paper Gemma Qwen Agent SQLite ChatGPT git Bipartite 报税 Mixtral Search NLP Permission DeepSeek Crawler Michelin RGB PyTorch Password 多线程 Git DeepStream Video Quantization 版权 Quantize 顶会 GGML 证件照 Math VPN Excel Translation Jetson WebCrawler 第一性原理 JSON Base64 Cloudreve llama.cpp 财报 Miniforge 图形思考法 logger InvalidArgumentError CV GPT4 Hungarian RAR Website printf FP16 AI Animate Template LaTeX Ptyhon 多进程 OCR TensorRT uwsgi Statistics Bitcoin UNIX Interview FastAPI Diagram Conda Vim Distillation ModelScope Pillow uWSGI CLAP Django Windows Pandas HuggingFace CUDA git-lfs Google TTS VSCode TensorFlow 飞书 YOLO Data XGBoost Pytorch 腾讯云 Qwen2.5 EXCEL 图标 Random Image2Text LLAMA Baidu Sklearn Github SVR CAM 算法题 FlashAttention NLTK tar Nginx BF16 Augmentation Firewall Food Land GPTQ Linux 关于博主 scipy LoRA Tiktoken PDB Datetime 递归学习法 UI Attention 域名 Ubuntu Magnet HaggingFace Llama 强化学习 CSV 音频 Markdown torchinfo ONNX QWEN Algorithm 继承 Plate CTC v0.dev icon Card Rebuttal Clash COCO GoogLeNet Logo FP32 SQL Disk Heatmap Paddle LeetCode Hotel Hilton C++ Plotly FP8 mmap Input Zip SPIE transformers VGG-16 Breakpoint Safetensors IndexTTS2 Bert
站点统计

本站现有博文323篇,共被浏览796019

本站已经建立2493天!

热门文章
文章归档
回到顶部