Finding pages on the unarchived Web (original) (raw)
Related papers
In Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York NY, 2014.
Vries, “Uncovering the unarchived web
2016
Full-Text and URL Search Over Web Archives
The Past Web
xCrawl: a high-recall crawling method for Web mining
Knowledge and Information Systems, 2010
Uncovering information hidden in Web archives
D-Lib magazine, 2002
Scraping SERPs for Archival Seeds: It Matters When You Start
arXiv (Cornell University), 2018
A Framework for Resourceful Retrieval of Specific Websites using Web Crawlers
Journal of Computer Science IJCSIS
Decoding the structure of the WWW: A comparative analysis of Web crawls
ACM Transactions on …, 2007
Coherence-Oriented Crawling and Navigation Using Patterns for Web Archives
Lecture Notes in Computer Science, 2011
Using the Web Infrastructure for Just-In-Time Recovery of Missing Web Pages
Scraping SERPs for Archival Seeds
Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries
Focused crawling: a new approach to topic-specific Web resource discovery
Computer Networks, 1999
Crawling the Infinite Web: Five Levels Are Enough
Lecture Notes in Computer Science, 2004
Improving the Quality of Web Archives through the Importance of Changes
Lecture Notes in Computer Science, 2011
Web Crawler: Design And Implementation For Extracting Article-Like Contents
Cybernetics and Physics, 2020
Characterizing Search Behavior in Web Archives
2011
Crawling the website deeply: Deep Page crawling
IJARCS, 2012
An Evaluation of Link Neighborhood Lexical Signatures to Rediscover Missing Web Pages
2011
Retrieving web pages using content, links, urls and anchors
2002
Mining the Web's link structure
IEEE Computer, 1999
A large-scale study of the evolution of web pages
World Wide Web Conference Series, 2003
Lexical Profiling of Existing Web Directories to Support Fine-grained Topic-Focused Web Crawling
Empirical evaluation of the link and content-based focused Treasure-Crawler
Computer Standards & Interfaces, 2016
Keeping it under lock and keywords: exploring new ways to open up the web archives with notebooks
Archival Science
Evaluating Methods to Rediscover Missing Web Pages from the Web Infrastructure
Intelligent and Adaptive Crawling of Web Applications for Web Archiving
2012
Available Challenges and Guidelines in the Field of Deep Web and Intensive Crawling
International Journal of Computer Applications, 2013
2005
Using Web Archive for Improving Search Engine Results
2006