EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
HaggingFace GGML 多线程 SAM Data Michelin CLAP Google v2ray Tracking LeetCode CAM CTC 图形思考法 SQLite icon Miniforge 财报 HuggingFace Bipartite WAN 阿里云 Breakpoint InvalidArgumentError WebCrawler Python DeepStream Dataset diffusers PDB Template FlashAttention Github 版权 GPTQ Website Pytorch Heatmap Cloudreve NLTK Baidu VSCode Django Qwen2 AI tar Git GoogLeNet Shortcut Logo PIP FP8 GPT4 torchinfo Bert Windows Conda BF16 Animate 顶会 NameSilo 签证 JSON PyCharm mmap TensorFlow TensorRT Pillow Linux Domain Vmess COCO Quantize Web 论文速读 Pickle Hungarian Docker Diagram Jetson MD5 UI Bin Input Datetime LLM TTS VGG-16 Plotly tqdm Tensor API Ubuntu Statistics RAR Paddle XGBoost uwsgi 飞书 PyTorch 音频 CUDA Rebuttal 图标 第一性原理 Claude 证件照 NLP BeautifulSoup Math Plate 云服务器 Qwen2.5 Tiktoken Numpy Pandas Base64 ResNet-50 Land 论文 Crawler 算法题 BTC Sklearn Card Random Magnet DeepSeek 域名 RGB Jupyter Use QWEN Knowledge 搞笑 Excel uWSGI Hotel FP64 ChatGPT Clash Proxy git Agent Nginx Image2Text OpenCV Interview Color 净利润 PDF Review Gemma Zip 腾讯云 YOLO scipy News SQL Safetensors Bitcoin CEIR git-lfs IndexTTS2 Quantization Markdown GIT Paper 强化学习 Food Augmentation C++ CC Distillation v0.dev CV Video Firewall Qwen EXCEL ONNX printf LoRA Llama 继承 UNIX OCR Anaconda Translation Streamlit Attention 公式 Permission 多进程 logger 递归学习法 Disk FastAPI TSV Hilton transformers Algorithm llama.cpp Search Mixtral LLAMA ModelScope Transformers SVR FP32 XML LaTeX 报税 Ptyhon CSV OpenAI Freesound 关于博主 hf Vim SPIE Password VPN FP16
站点统计

本站现有博文327篇,共被浏览832975

本站已经建立2538天!

热门文章
文章归档
回到顶部