Text and Face Detection, Web Automation, and Image Enhancement | PKS
This application uses YOLOv7 and PaddleOCR to detect text and faces on ID cards for use in registration or verification processes. Additionally, it includes image enhancement capabilities and web automation features to facilitate registration on the PKS website, providing a streamlined experience for users.
In Development

PROJECT OBJECTIVES
- Automatically detect and extract text and faces from ID cards using YOLOv7 and PaddleOCR for seamless data processing.
- Enhance image quality to ensure accurate data extraction from ID card photos.
- Automate form-filling processes on the PKS registration website, reducing manual work and improving efficiency.


DESIGN PROCESS
- Project Ideation and Problem Identification: Identify the need for a system that automates data extraction from ID cards and reduces manual processes during registration or verification on the PKS website.
- Research and Requirement Analysis: Determine suitable technologies for image processing and web automation, focusing on YOLOv7 for face detection, PaddleOCR for text recognition, and image enhancement methods for improved accuracy.
- Application Development: Develop the application with integration of text and face detection models, image enhancement algorithms, and automated form-filling functionality.
- Testing and QA: Conduct extensive testing to ensure accurate data detection, effective image enhancement, and smooth web automation without errors.
CHALLENGES
- Implementing accurate and efficient face and text detection models that work across varied image qualities of ID cards.
- Ensuring stable web automation that can handle various scenarios and forms on the PKS registration site.






LESSONS LEARNED
- Enhanced understanding of YOLOv7 and PaddleOCR for complex image processing and data extraction tasks.
- Learning and implementing effective image enhancement techniques for improved accuracy in OCR and face detection.
- Mastering web automation techniques for applications in repetitive, form-based workflows.