Mark Logic’s own Stephen Buxton, co-author of the definitive tome, Querying XML, has recently published an article in IT Pro (a publication of the IEEE Computer Society) entitled “Beyond Search: Content Applications.”
Here is a link to the article (subscription required). If you press the link you can either view the abstract or buy the article for $19. Here’s a link to the editor’s introduction of the issue (free), where he says:
“Stephen Buxton’s article on XML content servers describes the unique capabilities of this form of repository system and the extreme precision and information extraction that it can achieve. The server’s content of unstructured text is richly tagged, usually by inflow entity extractors or taxonomies. This provides a high degree of semantic quality and makes high relevancy search and disambiguation possible. Search, as well as other applications, can be developed to sit atop the server and take full advantage of the metadata. In this way, the enterprise can benefit from true information extraction in search as well as in other applications requiring high precision and a degree of semantic awareness.”
In the article Buxton differentiates enterprise search engines from XML content servers as candidate platforms for content applications.
He also discusses several example content applications, including:
- The Oxford University Press African American Studies Center, an online product for social sciences libraries and researchers that does extensive content integration and repurposing
- O’Reilly Media’s SafariU, a custom publishing system that enables professors to build custom books, online through a web interface with printed versions shipped to the campus bookstore in about 2 weeks
- Elsevier’s PathConsult, a highly contextual application designed for pathologists in order to assist them in the tricky task of differential diagnosis.
It’s worth the $19 — go ahead and get the article. Heck, it’s cheaper and faster to read than his book!