Footnotes
Problem: footnotes in text and webpages not cleanly delimited and tied to data; hard to place in database
Andrew Philpot, Sr Programmer, ISI
Jose-Luis Ambite, Rsch Scientist, ISI
Vasileios Hatzivassiloglou, Rsch Sci, Columbia
Jay Sandhaus, GRA, Columbia
- Current progress:
- automated extraction of footnotes from text and html
- Next steps:
- automatic determination of footnote scope
- identification of the concept(s) affected by a footnote, and of specific ways that footnotes alter concept semantics
- Current work: Extracted footnotes from html tables (some converted from pdf), and wrapped as sources for SIMS