STAGE 4

Description: Data Merging and Data Analysis

Entity type: Books

Data Sources: Amazon and Booksamillion

Data Analysis: OLAP Exploration

LINKS-
CSV file storing Table E
Matches between Tables A and B
Python script to merge the tables A and B
Python Script for OLAP analysis
Project report


STAGE 3

Description: Entity Matching

Entity type: Books

Data Sources: Amazon and Booksamillion

LINKS-
DATA
CODE
Project report
Jupyter Notebook


STAGE 2

Description: Web Data extraction

Entity type: Books

Data Sources: Amazon and Booksamillion

LINKS-
DATA
CODE
Project Report


STAGE 1

Description: Person Name Extraction from raw data about Baseball match reviews

Entity Type: Person name

Dataset: Baseball (MLB) commentaries.

Markup Style: <>Person Name</>

LINKS-
Complete dataset
Training dataset (Set I)
Test dataset (Set J)
Code
Compressed file
Project Report


To run the project, execute the following command: python Classifier.py