Information integration in SIMS: 2
Manually, for pdf, plain text, footnotes
Automatically, using parsing rules based on landmarks in html pages (Ariadne project)
Automated work will be incorporated and extended...
Give user equal access to data, metadata, and footnotes
Support for source combination and query optimization:
- Join, Union, Selection, interpreted predicates
Challenges:
- approximation: handle similar domain model concepts
- aggregation: reformulate data of various granularities