Tricky issues
Sources are not really databases!
- Legacy systems
- Limited access patters
- (Can’s ask a white-pages source for the list of all numbers)
- Limited local processing power
- Typically only selections (on certain attributes) are supported
- Sources are autonomous
- Unregulated data overlap
- Lack of full statistics on the sources