EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
News Anaconda Conda SQLite BTC Cloudreve tar Miniforge PIP Quantization GIT Django Color Tensor Card Quantize WAN Logo Nginx XGBoost 财报 Freesound MD5 torchinfo LoRA VPN TSV HuggingFace OCR Agent EXCEL 签证 递归学习法 Video Hungarian Web TensorRT Translation CEIR PyTorch C++ 图标 ResNet-50 DeepStream GPT4 Plate llama.cpp printf Ubuntu VGG-16 Paper Hilton TTS scipy LLM WebCrawler 报税 Website Safetensors 搞笑 LLAMA Python FP32 域名 Vmess JSON BF16 Claude Excel Disk LeetCode Streamlit Review 顶会 Rebuttal InvalidArgumentError Permission v0.dev Template ONNX 版权 Random Windows Pillow RGB Mixtral Zip GGML 净利润 Clash Qwen2.5 GPTQ FP8 多进程 第一性原理 Gemma Input Magnet XML DeepSeek BeautifulSoup OpenAI Plotly Base64 FlashAttention Distillation Dataset Pytorch Pandas Llama ChatGPT Jupyter 证件照 Transformers UI Hotel Github Git SQL IndexTTS2 NLTK HaggingFace Land SVR OpenCV uwsgi CV Google Ptyhon Attention 关于博主 Vim NameSilo Use CC Data COCO transformers git-lfs CUDA SPIE git uWSGI SAM Diagram TensorFlow tqdm Bitcoin Numpy CAM Datetime Interview 音频 Markdown API 飞书 QWEN Proxy Tiktoken Michelin Algorithm Docker PDB Password CTC Jetson FP64 Knowledge CSV YOLO NLP v2ray hf diffusers 算法题 多线程 Animate Sklearn Bipartite Food Bert GoogLeNet Qwen2 PDF Paddle Domain Linux Shortcut 强化学习 PyCharm LaTeX Baidu 阿里云 图形思考法 Bin mmap CLAP 腾讯云 Firewall 云服务器 Math AI Search Augmentation Heatmap Crawler Tracking 公式 Qwen Breakpoint logger Statistics 继承 UNIX ModelScope Image2Text icon RAR FastAPI VSCode FP16 Pickle
站点统计

本站现有博文323篇,共被浏览795258

本站已经建立2493天!

热门文章
文章归档
回到顶部