E-Commerce Technology for Dynamic E-Business, IEEE International Conference on
Download PDF

Abstract

The problem of data extraction from the Deep Web can be divided into two tasks: crawling the client-side and the server-side deep web. The objective of this paper is to define an architecture and a set of related techniques to access the information placed in the client-side deep web. This involves dealing with aspects such as JavaScript technology, non-standard session maintenance mechanisms, client redirections, pop-up menus, etc. Our work uses current browser APIs as building blocks and leverages them to implement novel crawling models and algorithms.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!

Related Articles