The connectivity sonar, Proceedings of the fourteenth ACM conference on Hypertext and hypermedia , HYPERTEXT '03, 2003. ,
DOI : 10.1145/900051.900060
Searching for hidden-Web databases, WebDB, 2005. ,
An adaptive crawler for locating hidden-web entry points, WWW, 2007. ,
A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, 1992. ,
DOI : 10.1145/130385.130401
iRobot, Proceeding of the 17th international conference on World Wide Web , WWW '08, 2008. ,
DOI : 10.1145/1367497.1367558
Focused crawling: a new approach to topic-specific Web resource discovery, Computer Networks, vol.31, issue.11-16, pp.3111-3127, 1999. ,
DOI : 10.1016/S1389-1286(99)00052-3
Path sharing and predicate evaluation for high-performance XML filtering, ACM Transactions on Database Systems, vol.28, issue.4, 2003. ,
DOI : 10.1145/958942.958947
OXPath: A language for scalable, memory-efficient data extraction from web applications, p.4, 2011. ,
The volume and evolution of web page templates, Special interest tracks and posters of the 14th international conference on World Wide Web , WWW '05, 2005. ,
DOI : 10.1145/1062745.1062763
Board Forum Crawling: A Web Crawling Method for Web Forum, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06), 2006. ,
DOI : 10.1109/WI.2006.52
Web application description language. http://www.w3.org/Submission/wadl/. [13] International Business Times, 2011. ,
Svms for the Blogosphere: Blog Identification and Splog Detection, AAAI, 2006. ,
Coarse-grained classification of web sites by their structural properties, Proceedings of the eighth ACM international workshop on Web information and data management , WIDM '06, 2006. ,
DOI : 10.1145/1183550.1183559
Classifying web sites, Proceedings of the 16th international conference on World Wide Web , WWW '07, 2007. ,
DOI : 10.1145/1242572.1242736
A rule-based query language for HTML, DASFAA, 2001. ,
An improved training algorithm for support vector machines, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop, 1997. ,
DOI : 10.1109/NNSP.1997.622408
Building light-weight wrappers for legacy Web data-sources using W4F, VLDB, 1999. ,
Wraplet: Wrapping Your Web Contents with a Lightweight Language, 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System, 2007. ,
DOI : 10.1109/SITIS.2007.135
Declarative information extraction using Datalog with embedded extraction predicates, VLDB, 2007. ,
Incremental crawling with Heritrix, IWAW, 2005. ,
On design of browser-oriented data extraction system and plug-ins, JMST, 2010. ,
Joint optimization of wrapper generation and template detection, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining , KDD '07, 2007. ,
DOI : 10.1145/1281192.1281287