- Tom Sawyer Adventure Series
- The Adventures of Tom Sawyer
- The Adventures of Huckleberry Finn
- Tom Sawyer Abroad
- Tom Sawyer Detective
Download the Tom Sawyer Adventure Series in TXT format (fragments ~= chapters)
TomSawyerTXT.zip.
Download
the Tom Sawyer Adventure Series (fragments ~= chapters) annotated using
the Calais Web service (input RAW TEXT, output: CalaisSimple
OutputFormat) and linked with DBpedia and Geonames data TomSawyerTXT[RAW-Simple]
or
Browse the files.
Download the detailed log file log.txt for the Tom Sawyer Adventure Series.
Stats for DBpedia retrieval:
* number of DBpedia queries is: 646
+ number of person queries: 646
- number of persons found: 118
- number of persons not found: 524
Stats for GeoNames retrieval:
* number of Geonames queries is: 138
+ number of city queries: 73
- number of cities found: 69
- number of cities not found: 4
- number of cities with broader relationship found: 0
- number of cities with broader relationship not found: 0
+ number of country queries: 53
- number of countries found: 53
- number of countries not found: 0
+ number of continent queries: 12
- number of continents found: 12
- number of continents not found: 0
- Finished processing, arguments used:
+ -c, --calais
+ -d, --dbpedia
+ -g, --geonames
+ -i, --idrank
LEGEND: (unique by Calais - i.e. we may have the same entity occuring in different chapters)
Person
[unique: 45,
count: 1444]
Position
[unique: 27,
count: 66]
Location
[unique: 10,
count: 11]
:
City
[unique: 3,
count: 4]
,
ProvinceOrState
[unique: 6,
count: 6]
,
Country
[unique: 0,
count: 0]
,
Continent
[unique: 1,
count: 1]
Misc
[unique: 24,
count: 39]
All
[unique: 96,
count: 1549]
BILL WITHERS
(freq: 12, ch: 2)




93-ch06
(
count="2"
relevance="0.272"
)
93-ch12
(
count="10"
relevance="0.444"
)
BRACE
(freq: 2, ch: 1)
93-ch03
(count="2"
relevance="0.103"
)
BRACE DUNLAP
(freq: 90, ch: 2)


93-ch02
(
count="26"
relevance="0.384"
)
93-ch12
(
count="64"
relevance="0.577"
)
BRUCE DUNLAP
(freq: 6, ch: 1)


93-ch12
(
count="6"
relevance="0.113"
)
BUD DIXON
(freq: 34, ch: 2)


93-ch04
(
count="15"
relevance="0.476"
)
93-ch08
(
count="19"
relevance="0.342"
)
HAL CLAYTON
(freq: 91, ch: 3)


93-ch04
(
count="26"
relevance="0.425"
)
93-ch05
(
count="64"
relevance="0.963"
)
93-ch08
(
count="1"
relevance="0.099"
)
JACK
(freq: 5, ch: 1)
93-ch06
(count="5"
relevance="0.525"
)
JACK WITHERS
(freq: 34, ch: 1)


93-ch12
(
count="34"
relevance="0.418"
)
JAKE
(freq: 1, ch: 1)
93-ch12
(count="1"
relevance="0.054"
)
JAKE DUNLAP
(freq: 49, ch: 5)


93-ch03
(
count="3"
relevance="0.492"
)
93-ch04
(
count="14"
relevance="0.430"
)
93-ch06
(
count="6"
relevance="0.579"
)
93-ch08
(
count="1"
relevance="0.106"
)
93-ch09
(
count="25"
relevance="0.491"
)
JEFF HOOKER
(freq: 18, ch: 1)


93-ch10
(
count="18"
relevance="0.624"
)
JIM LANE
(freq: 75, ch: 3)


93-ch06
(
count="8"
relevance="0.394"
)
93-ch07
(
count="17"
relevance="0.617"
)
93-ch12
(
count="50"
relevance="0.536"
)
JUBITER DUNLAP
(freq: 273, ch: 4)


93-ch02
(
count="32"
relevance="0.374"
)
93-ch03
(
count="39"
relevance="0.538"
)
93-ch11
(
count="83"
relevance="0.886"
)
93-ch12
(
count="119"
relevance="0.676"
)
LEM
(freq: 8, ch: 1)
93-ch12
(count="8"
relevance="0.440"
)
LEM BEEBE
(freq: 2, ch: 2)


93-ch06
(
count="1"
relevance="0.115"
)
93-ch12
(
count="1"
relevance="0.167"
)
POOR BENNY
(freq: 24, ch: 1)


93-ch12
(
count="24"
relevance="0.516"
)
SAM COOPER
(freq: 7, ch: 1)


93-ch12
(
count="7"
relevance="0.254"
)
SHERIFF
(freq: 4, ch: 1)
93-ch12
(count="4"
relevance="0.407"
)
SILAS
THEM
(freq: 29, ch: 1)
93-ch11
(count="29"
relevance="0.712"
)
STEVE NICKERSON
(freq: 1, ch: 1)


93-ch09
(
count="1"
relevance="0.078"
)
TOM
(freq: 92, ch: 2)
93-ch05
(count="1"
relevance="0.050"
)
93-ch08
(count="91"
relevance="0.869"
)
TOM SAWYER
(freq: 587, ch: 8)




93-ch01
(
count="1"
relevance="0.714"
)
93-ch02
(
count="79"
relevance="0.847"
)
93-ch03
(
count="70"
relevance="0.867"
)
93-ch04
(
count="38"
relevance="0.785"
)
93-ch07
(
count="64"
relevance="0.928"
)
93-ch09
(
count="92"
relevance="0.931"
)
93-ch10
(
count="87"
relevance="0.942"
)
93-ch12
(
count="156"
relevance="0.776"
)
93-ch01[0-23]-calais.rdfPerson:
TOM SAWYER
(
count="1"
relevance="0.714"
)




Position:
DETECTIVE
(count="1"
relevance="0.714"
)
93-ch02[24-9296]-calais.rdfContinent:
America
(
count="1"
relevance="0.307"
)




Facility: Silas's farm
(count="5"
relevance="0.612"
)
MedicalCondition: fever
(count="2"
relevance="0.309"
)
MedicalCondition: ache
(count="1"
relevance="0.302"
)
Person:
Tom Sawyer
(
count="79"
relevance="0.847"
)




Person:
Jubiter Dunlap
(
count="32"
relevance="0.374"
)


Person:
Brace Dunlap
(
count="26"
relevance="0.384"
)


Position:
head
(count="1"
relevance="0.211"
)
Position:
school teacher
(count="1"
relevance="0.106"
)
Position:
preacher
(count="1"
relevance="0.046"
)
ProvinceOrState:
Mississippi
(
count="1"
relevance="0.305"
normalized="Mississippi,United States"
)
Topics:
()
93-ch03[9297-17489]-calais.rdfCity:
St. Louis
(
count="1"
relevance="0.308"
)




IndustryTerm: machinery
(count="1"
relevance="0.105"
)
Person:
Tom Sawyer
(
count="70"
relevance="0.867"
)




Person:
Jubiter Dunlap
(
count="39"
relevance="0.538"
)


Person:
JAKE DUNLAP
(
count="3"
relevance="0.492"
)


Person:
Brace
(count="2"
relevance="0.103"
)
Position:
waiter
(count="2"
relevance="0.330"
)
Position:
chair
(count="1"
relevance="0.041"
)
ProvinceOrState:
Iowa
(
count="1"
relevance="0.271"
normalized="Iowa,United States"
)
ProvinceOrState:
Louisiana
(
count="1"
relevance="0.308"
normalized="Louisiana,United States"
)
Topics:
()
93-ch04[17490-28571]-calais.rdfCity:
St. Louis
(
count="2"
relevance="0.282"
)




Currency: USD
(count="2"
relevance="0.270"
)
IndustryTerm: steel plate
(count="1"
relevance="0.062"
)
Person:
Tom Sawyer
(
count="38"
relevance="0.785"
)




Person:
Hal Clayton
(
count="26"
relevance="0.425"
)


Person:
Bud Dixon
(
count="15"
relevance="0.476"
)


Person:
Jake Dunlap
(
count="14"
relevance="0.430"
)


Position:
hotel clerk
(count="1"
relevance="0.241"
)
Position:
guard
(count="1"
relevance="0.127"
)
Topics:
()
93-ch05[28572-36133]-calais.rdfIndustryTerm: machinery
(count="2"
relevance="0.211"
)
Person:
Hal Clayton
(
count="64"
relevance="0.963"
)


Person:
Tom
(count="1"
relevance="0.050"
)
Position:
chair
(count="1"
relevance="0.310"
)
ProvinceOrState:
Missouri
(
count="1"
relevance="0.311"
normalized="Missouri,United States"
)
ProvinceOrState:
Iowa
(
count="1"
relevance="0.311"
normalized="Iowa,United States"
)
Topics:
()
93-ch06[36134-41179]-calais.rdfIndustryTerm: machinery
(count="1"
relevance="0.325"
)
Person:
Jim Lane
(
count="8"
relevance="0.394"
)


Person:
Jake Dunlap
(
count="6"
relevance="0.579"
)


Person:
Jack
(count="5"
relevance="0.525"
)
Person:
Bill Withers
(
count="2"
relevance="0.272"
)




Person:
Lem Beebe
(
count="1"
relevance="0.115"
)


Topics:
()
93-ch07[41180-51017]-calais.rdfCurrency: USD
(count="3"
relevance="0.406"
)
IndustryTerm: paint
(count="1"
relevance="0.306"
)
Person:
Tom Sawyer
(
count="64"
relevance="0.928"
)




Person:
Jim Lane
(
count="17"
relevance="0.617"
)


Topics:
()
93-ch08[51018-58964]-calais.rdfEvent: Natural Disaster
(count="1"
)
Person:
Tom
(count="91"
relevance="0.869"
)
Person:
Bud Dixon
(
count="19"
relevance="0.342"
)


Person:
Hal Clayton
(
count="1"
relevance="0.099"
)


Person:
Jake Dunlap
(
count="1"
relevance="0.106"
)


Position:
head
(count="1"
relevance="0.307"
)
Topics:
()
93-ch09[58965-68260]-calais.rdfPerson:
Tom Sawyer
(
count="92"
relevance="0.931"
)




Person:
Jake Dunlap
(
count="25"
relevance="0.491"
)


Person:
Steve Nickerson
(
count="1"
relevance="0.078"
)


Position:
head
(count="1"
relevance="0.307"
)
Position:
chair
(count="1"
relevance="0.298"
)
Position:
gray head
(count="1"
relevance="0.288"
)
Topics:
()
93-ch10[68261-78196]-calais.rdfPerson:
Tom Sawyer
(
count="87"
relevance="0.942"
)




Person:
Jeff Hooker
(
count="18"
relevance="0.624"
)


Position:
blacksmith
(count="3"
relevance="0.418"
)
Position:
chair
(count="1"
relevance="0.044"
)
Topics:
()
93-ch11[78197-84259]-calais.rdfEvent: Judicial Event
(count="1"
)
Person:
Jubiter Dunlap
(
count="83"
relevance="0.886"
)


Person:
SILAS
THEM
(count="29"
relevance="0.712"
)
Position:
sheriff at the door
(count="1"
relevance="0.069"
)
Position:
chair
(count="1"
relevance="0.308"
)
ProvinceOrState:
Arkansaw
(
count="1"
relevance="0.069"
)
Topics:
()
93-ch12[84260-120839]-calais.rdfCity:
St. Louis
(
count="1"
relevance="0.179"
)




Currency: USD
(count="6"
relevance="0.323"
)
Person:
TOM SAWYER
(
count="156"
relevance="0.776"
)




Person:
Jubiter Dunlap
(
count="119"
relevance="0.676"
)


Person:
Brace Dunlap
(
count="64"
relevance="0.577"
)


Person:
Jim Lane
(
count="50"
relevance="0.536"
)


Person:
Jack Withers
(
count="34"
relevance="0.418"
)


Person:
Poor Benny
(
count="24"
relevance="0.516"
)


Person:
Bill Withers
(
count="10"
relevance="0.444"
)




Person:
Lem
(count="8"
relevance="0.440"
)
Person:
Sam Cooper
(
count="7"
relevance="0.254"
)


Person:
Bruce Dunlap
(
count="6"
relevance="0.113"
)


Person:
Sheriff
(count="4"
relevance="0.407"
)
Person:
Jake
(count="1"
relevance="0.054"
)
Person:
Lem Beebe
(
count="1"
relevance="0.167"
)


Position:
judge
(count="17"
relevance="0.585"
)
Position:
lawyer
(count="11"
relevance="0.467"
)
Position:
Sheriff
(count="8"
relevance="0.481"
)
Position:
chair
(count="3"
relevance="0.525"
)
Position:
preacher
(count="2"
relevance="0.251"
)
Position:
regular lawyer
(count="1"
relevance="0.267"
)
Position:
detective
(count="1"
relevance="0.078"
)
Position:
back-settlement lawyer
(count="1"
relevance="0.300"
)
Position:
preacher at that.
(count="1"
relevance="0.248"
)
Position:
lawyer for the prostitution got up and begun
(count="1"
relevance="0.300"
)
PublishedMedium: the time
(count="1"
relevance="0.222"
)
Topics:
()