EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
C++ git-lfs LoRA JSON CC 顶会 icon torchinfo Jetson Excel 飞书 NameSilo OpenAI BeautifulSoup News Bert SAM Algorithm Bin Website Firewall Nginx Review Web NLTK Search FastAPI Domain llama.cpp Paddle 音频 logger Datetime Food Conda VPN Vmess Markdown Zip Windows 论文 PyCharm 云服务器 Translation PDB VGG-16 git Crawler Qwen2.5 Ptyhon TensorRT Llama CEIR PDF Qwen OCR 公式 TTS WAN uWSGI Paper Land InvalidArgumentError SQLite LaTeX FP32 Git Bipartite Pickle tqdm Knowledge GoogLeNet Tensor RGB Pandas FlashAttention Gemma Freesound Math UI Quantization PIP CSV uwsgi v0.dev printf scipy Cloudreve Python HuggingFace Bitcoin Ubuntu Plotly OpenCV LeetCode 第一性原理 CLAP 多线程 YOLO Interview PyTorch Miniforge IndexTTS2 SPIE WebCrawler Transformers FP8 Github TSV Augmentation FP64 算法题 Google Pillow Claude Logo API Video Template BF16 Rebuttal Statistics Heatmap GIT Numpy Django CAM RAR 多进程 图标 v2ray 域名 Docker SVR DeepSeek Hungarian Image2Text ONNX Distillation BTC transformers 证件照 Baidu Diagram 财报 净利润 阿里云 GPT4 diffusers NLP Agent Data hf QWEN 搞笑 ChatGPT GGML FP16 LLM 报税 Safetensors Random TensorFlow mmap VSCode XGBoost 继承 Password CTC ResNet-50 Qwen2 Proxy AI Mixtral Animate GPTQ Magnet Base64 Card Permission Clash Hotel Linux MD5 SQL COCO 论文速读 递归学习法 EXCEL Dataset CV 腾讯云 Tracking Streamlit Color Jupyter Breakpoint 图形思考法 Plate 强化学习 Use DeepStream CUDA 版权 Sklearn HaggingFace Quantize Input 签证 Tiktoken LLAMA Disk tar UNIX Pytorch Michelin Hilton Vim Shortcut XML 关于博主 ModelScope Anaconda Attention
站点统计

本站现有博文327篇,共被浏览833773

本站已经建立2538天!

热门文章
文章归档
回到顶部