SNDT WOMEN'S UNIVERSITY

BMK Knowledge Resource Centre

Vithaldas Vidyavihar, Juhu Tara Road,
Santacruz (West) Mumbai - 400049

Optimized Template Detection and Extraction Algorithm for Web Scraping Web Pages

Gaurav Gupta

Optimized Template Detection and Extraction Algorithm for Web Scraping Web Pages - P.145-158


Clustering
Document Object Model (DOM) tree
Web Extraction
Template Detection