Image-based Entity Value Extraction

Project Overview

This project aims to extract entity values (such as weight, volume, dimensions) from product images using machine learning techniques. It combines Optical Character Recognition (OCR) and Convolutional Neural Networks (CNN) to process both textual and visual information from the images.

Setup and Installation

Clone the repository
Install dependencies: pip install -r requirements.txt
Download and preprocess the dataset using data_preparation.py

Usage

Prepare the data: python data_preparation.py
Extract features: python feature_extraction.py
Train the model: python model_training.py
Generate predictions: python predict.py

Model Architecture

The model uses a hybrid architecture:

OCR branch: Embedding layer followed by LSTM
CNN branch: Pre-extracted features processed by fully connected layers
Combined output: Concatenated features passed through fully connected layers

Performance

Validation Accuracy: 87%
F1 Score: 0.85

Future Improvements

Implement data augmentation techniques
Explore more advanced OCR methods
Fine-tune hyperparameters using techniques like Bayesian optimization

Contact

For any questions or issues, please open an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Resource		Resource
Image_Based_Entity_Extraction_Project.docx		Image_Based_Entity_Extraction_Project.docx
README.md		README.md
cnn-feature-extraction.py		cnn-feature-extraction.py
data-preparation.py		data-preparation.py
error-analysis.py		error-analysis.py
image-preprocessing.py		image-preprocessing.py
label-preprocessing.py		label-preprocessing.py
model-development.py		model-development.py
ocr-feature-extraction.py		ocr-feature-extraction.py
performance-optimization.py		performance-optimization.py
prediction-output-generation.py		prediction-output-generation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image-based Entity Value Extraction

Project Overview

Setup and Installation

Usage

Model Architecture

Performance

Future Improvements

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Image-based Entity Value Extraction

Project Overview

Setup and Installation

Usage

Model Architecture

Performance

Future Improvements

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages