东毅居士

Check the Index and Token from Tiktoken

作者：XD / 发表： 2024年2月4日 01:27 / 编程笔记/ 阅读量：2103

Check the Index and Token from Tiktoken

Use md5sum to Verify File Integrity

作者：XD / 发表： 2024年1月31日 21:34 / 编程笔记/ 阅读量：2220

Use md5sum to Verify File Integrity

llama.cpp: Definations of Q2_K, Q3_K, Q4_K, Q5_K, Q6_K, and Q8_K Structures

作者：XD / 发表： 2024年1月25日 01:05 / 编程笔记/ 阅读量：6182

llama.cpp: Definitions of Q2_K, Q3_K, Q4_K, Q5_K, Q6_K, and Q8_K Structures

llama.cpp: Efficient 6-bit Data Packing in an 8-bit Array

作者：XD / 发表： 2024年1月25日 00:39 / 编程笔记/ 阅读量：2255

llama.cpp: Efficient 6-bit Data Packing in an 8-bit Array

Setting Up v2rayNG with Tencent Cloud Silicon Valley Lighthouse

作者：XD / 发表： 2023年12月7日 02:18 / 编程笔记/ 阅读量：2646

Setting Up v2rayNG with Tencent Cloud Silicon Valley Lighthouse 利用腾讯云配置自己的v2ray

Check All Values from One Tensor Equal to One Value

作者：XD / 发表： 2023年11月16日 06:26 / 编程笔记/ 阅读量：1661

Check All Values from One Tensor Equal to One Value

Pytorch GPTQ Dequantizing Function

作者：XD / 发表： 2023年11月13日 21:56 / 编程笔记/ 阅读量：2302

Pytorch GPTQ Dequantizing Function

Pytorch Q4_1 Quantize and Dequantize aligning with llama.cpp

作者：XD / 发表： 2023年11月13日 21:49 / 编程笔记/ 阅读量：2122

Pytorch Q4_1 Quantize and Dequantize aligning with llama.cpp

Pytorch Q4_0 Quantize and Dequantize aligning with llama.cpp

作者：XD / 发表： 2023年11月13日 21:42 / 编程笔记/ 阅读量：2142

Pytorch Q4_0 Quantize and Dequantize aligning with llama.cpp

Check the KB Size of the File or Folder in the Linux

作者：XD / 发表： 2023年11月12日 23:12 / 编程笔记/ 阅读量：1765

Check the KB Size of the File or Folder in the Linux

Qwen-7B-Chat模型结构注释

作者：XD / 发表： 2023年10月31日 23:52 / 编程笔记/ 阅读量：4759

Qwen-7B-Chat模型结构注释

Update Code in Django+Nginx+uwsgi Environment

作者：XD / 发表： 2023年10月26日 02:51 / 编程笔记/ 阅读量：1739

Update Code in Django+Nginx+uwsgi Environment

ImportError: cannot import name 'randn_tensor'

作者：XD / 发表： 2023年10月12日 01:14 / 编程笔记/ 阅读量：3499

from diffusers.utils import randn_tensor ImportError: cannot import name 'randn_tensor' from 'diffusers.utils'

pip install FlashAttention

作者：XD / 发表： 2023年10月11日 04:22 / 编程笔记/ 阅读量：2254

pip install FlashAttention

Change ModelScope Cache Folder

作者：XD / 发表： 2023年9月26日 05:48 / 编程笔记/ 阅读量：3131

Change ModelScope Cache Folder

Download Model or Dataset from ModelScope

作者：XD / 发表： 2023年9月25日 04:04 / 编程笔记/ 阅读量：3002

Download Model or Dataset from ModelScope

原 Check the Index and Token from Tiktoken

作者：XD / 发表： 2024年2月4日 01:27 / 编程笔记/ 阅读量：2103

原 Use md5sum to Verify File Integrity

作者：XD / 发表： 2024年1月31日 21:34 / 编程笔记/ 阅读量：2220

原 llama.cpp: Definations of Q2_K, Q3_K, Q4_K, Q5_K, Q6_K, and Q8_K Structures

作者：XD / 发表： 2024年1月25日 01:05 / 编程笔记/ 阅读量：6182

原 llama.cpp: Efficient 6-bit Data Packing in an 8-bit Array

作者：XD / 发表： 2024年1月25日 00:39 / 编程笔记/ 阅读量：2255

原 Setting Up v2rayNG with Tencent Cloud Silicon Valley Lighthouse

作者：XD / 发表： 2023年12月7日 02:18 / 编程笔记/ 阅读量：2646

原 Check All Values from One Tensor Equal to One Value

作者：XD / 发表： 2023年11月16日 06:26 / 编程笔记/ 阅读量：1661

原 Pytorch GPTQ Dequantizing Function

作者：XD / 发表： 2023年11月13日 21:56 / 编程笔记/ 阅读量：2302

原 Pytorch Q4_1 Quantize and Dequantize aligning with llama.cpp

作者：XD / 发表： 2023年11月13日 21:49 / 编程笔记/ 阅读量：2122

原 Pytorch Q4_0 Quantize and Dequantize aligning with llama.cpp

作者：XD / 发表： 2023年11月13日 21:42 / 编程笔记/ 阅读量：2142

原 Check the KB Size of the File or Folder in the Linux

作者：XD / 发表： 2023年11月12日 23:12 / 编程笔记/ 阅读量：1765

原 Qwen-7B-Chat模型结构注释

作者：XD / 发表： 2023年10月31日 23:52 / 编程笔记/ 阅读量：4759

原 Update Code in Django+Nginx+uwsgi Environment

作者：XD / 发表： 2023年10月26日 02:51 / 编程笔记/ 阅读量：1739

原 ImportError: cannot import name 'randn_tensor'

作者：XD / 发表： 2023年10月12日 01:14 / 编程笔记/ 阅读量：3499

原 pip install FlashAttention

作者：XD / 发表： 2023年10月11日 04:22 / 编程笔记/ 阅读量：2254

原 Change ModelScope Cache Folder

作者：XD / 发表： 2023年9月26日 05:48 / 编程笔记/ 阅读量：3131

原 Download Model or Dataset from ModelScope

作者：XD / 发表： 2023年9月25日 04:04 / 编程笔记/ 阅读量：3002

Check the Index and Token from Tiktoken

Use md5sum to Verify File Integrity

llama.cpp: Definations of Q2_K, Q3_K, Q4_K, Q5_K, Q6_K, and Q8_K Structures

llama.cpp: Efficient 6-bit Data Packing in an 8-bit Array

Setting Up v2rayNG with Tencent Cloud Silicon Valley Lighthouse

Check All Values from One Tensor Equal to One Value

Pytorch GPTQ Dequantizing Function

Pytorch Q4_1 Quantize and Dequantize aligning with llama.cpp

Pytorch Q4_0 Quantize and Dequantize aligning with llama.cpp

Check the KB Size of the File or Folder in the Linux

Qwen-7B-Chat模型结构注释

Update Code in Django+Nginx+uwsgi Environment

ImportError: cannot import name 'randn_tensor'

pip install FlashAttention

Change ModelScope Cache Folder

Download Model or Dataset from ModelScope