Anda di halaman 1dari 17

The Deep Web

May 15, 2015

What is the Deep Web


Aka the Invisible Web
Contents from thousands of
specialized, searchable databases
Contents from web pages that
search engines cannot index

Searching the (regular)


Internet
> 170 million web sites made up of
billions of web pages/documents
>4.5 billion queries/searches per month
Payment schemes
Pay for inclusion: $ fees for a business to be
included in index
Pay for position: sites are ranked
Pay per click: Keywords auctioned to advertisers
Pay per action: Advertisers pay only when a click
results in an action.

Payment schemes:
Consequences
Results may be from sites which can
pay to be listed:
Results may not be most appropriate.
Nonprofit sites are listed but not given priority

Searches result in pop up


advertisements
Must use more than one search tool
Deep web sites arent included in the
indexes of search tools
4

Google vs. the Deep Web


Search engines need static and stable links to
pages in order to index them
Google indexes 20% of the web missing 80% of
internet content
Google does not typically index

Proprietary sites
Sites with scripts
Dynamic sites
Sites blocked by search engine policy
Sites with special formats
Searchable databases

How big is the Deep Web?


550 billion documents
500x the content of the surface Web
Internet search tool = searches 20%
of available content.
Deep web is not Internet2

Why use the Deep Web?


Higher quality sources
Selected and organized by subject
experts

Dynamic display
Customized data sets
Some data is visual, and not word
searchable

Whats in the Deep Web?

Searchable databases
Downloadable files & spreadsheets
Image and multi-media files
Data sets
Various file formats such as .pdf, .xls,
.mdb,
Lots of government information
8

Deep Web Collections


Web Lens :
http://www.weblens.org/scholar.html
Info Mine:
http://infomine.ucr.edu/
Internet Archive:
http://www.archive.org/index.php

How to Keep Track of the Deep


Web
Subscribe to:
Listserv
Information Sources Blog
Discussion Board

Read:
The Periodical Literature

Consult:
Librarians
Faculty
Colleagues
10

Deep Web Sites for


English/British Lit
1. Middle English Dictionary
http://quod.lib.umich.edu/m/med/
2. University of VA E-Text English
Language Resources
http://etext.lib.virginia.edu/collecti
ons/languages/english/
3. The New Chaucer Society
http://artsci.wustl.edu/~chaucer/
11

Deep Web Sites for


English/British Lit
4. The Internet Public Library Online Literary
Criticism Collection
http://www.ipl.org/div/litcrit
5. Romanticism on the Net
http://www.ron.umontreal.ca/
6. The Bront Sisters Web
http://www.lang.nagoya-u.ac.jp/~matsuoka/Bro
nte.html

12

Deep Web Sites for


English/British Lit
7. Mr. William Shakespeare and the
Internet
http://shakespeare.palomar.edu
8. The Voice of the Shuttle
http://vos.ucsb.edu
9. The Victorian Web
http://www.victorianweb.org/
13

Deep Web Sites for


English/British Lit
10.The Rossetti Archive
http://www.rossettiarchive.org/
11.Introductory Guide to Critical Theory
http://www.cla.purdue.edu/academic/engl/the
ory/
12.Anglo-Saxon England Guide to Online
Resources
http://www.theorb.net/encyclop/early/pre1000/asindex.html
14

Deep Web Sites for


English/British Lit
13.Representative Poetry Online
http://rpo.library.utoronto.ca/display/
index.cfm
14.The Poetry Archive
URL:http://www.poetryarchive.org
15.Open Source Shakespeare
www.opensourceshakespeare.org/
15

Deep Web Sites for


English/British Lit
16.Blake Digital Text Project
www.english.uga.edu/~wblake/home
1.html
17.The Blake Archive
www.blakearchive.org/

16

Deep Web Exercise


Working as a group:
Annotate the links on slides 11-16.

Post your annotations, comments,


searching tips, to the British Lit Deep
Web Wiki.
Connect to the Research Wikis link
from the librarys homepage:
www.cabrini.edu/library
17