<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Kellblog &#187; content applications</title>
	<atom:link href="http://kellblog.com/category/content-applications/feed/" rel="self" type="application/rss+xml" />
	<link>http://kellblog.com</link>
	<description>The official blog of Dave Kellogg</description>
	<lastBuildDate>Thu, 09 Feb 2012 19:36:01 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='kellblog.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/8ecfcffdb3cd0948a0c38207c0ca38d6?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Kellblog &#187; content applications</title>
		<link>http://kellblog.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://kellblog.com/osd.xml" title="Kellblog" />
	<atom:link rel='hub' href='http://kellblog.com/?pushpress=hub'/>
		<item>
		<title>Norm Walsh Making The Case For XQuery</title>
		<link>http://kellblog.com/2008/12/15/norm-walsh-making-the-case-for-xquery/</link>
		<comments>http://kellblog.com/2008/12/15/norm-walsh-making-the-case-for-xquery/#comments</comments>
		<pubDate>Mon, 15 Dec 2008 15:45:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[Mark Logic]]></category>
		<category><![CDATA[Markup Languages]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[XQuery]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2008/12/15/norm-walsh-making-the-case-for-xquery/</guid>
		<description><![CDATA[Mark Logic&#8216;s Norm Walsh recently wrote an article for the Data Conversion Labs website, entitled Making the Case for XQuery. Excerpt: But now that you have an XML repository, what are you going to do with it? What and how &#8230; <a href="http://kellblog.com/2008/12/15/norm-walsh-making-the-case-for-xquery/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4319&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.marklogic.com/" title="Mark Logic" rel="homepage" class="zem_slink">Mark Logic</a>&#8216;s <a href="http://norman.walsh.name/">Norm Walsh</a> recently wrote an article for the <a href="http://www.dclab.com/xquery.asp">Data Conversion Labs</a> website, entitled <a href="http://www.dclab.com/xquery.asp">Making the Case for XQuery</a>.  Excerpt:<br />
<blockquote>But now that you have an <a href="http://en.wikipedia.org/wiki/XML" title="XML" rel="wikipedia" class="zem_slink">XML</a> repository, what are you going to do with it? What and how may you deploy it? Simply having large masses of XML converted data doesn&#8217;t necessarily mean that the data in this form is even useful.</p>
<p>Enter XQuery.</p></blockquote>
<p>  Norm then explains the problem with using other languages with XML documentbases<br />
<blockquote>The problem with other <a href="http://en.wikipedia.org/wiki/Programming_language" title="Programming language" rel="wikipedia" class="zem_slink">programming languages</a> isn&#8217;t that they aren&#8217;t able to process XML, it&#8217;s that they aren&#8217;t able to process XML efficiently. Data has to be converted from XML to the language&#8217;s native data structures. Once converted, it must be manipulated with functions that don&#8217;t understand the underlying model and are, consequently, not always a good fit. This &#8220;<a href="http://en.wikipedia.org/wiki/Impedance_matching" title="Impedance matching" rel="wikipedia" class="zem_slink">impedance mismatch</a>&#8221; causes confusion and can introduce errors. Finally, the programming language structures have to be converted back into XML. Each of these steps is tedious, time consuming, and introduces the possibility of errors. In a sophisticated application, this process may have to occur several times for each XML resource. </p></blockquote>
<p>If all this looks interesting, the full article is <a href="http://www.dclab.com/xquery.asp">here</a>.</p>
<p>
<fieldset>
<legend>Related articles by Zemanta</legend>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://marklogic.blogspot.com/2008/05/norm-walsh-joins-mark-logic.html">Norm Walsh Joins Mark Logic</a></li>
<li class="zemanta-article-ul-li"><a href="http://marklogic.blogspot.com/2008/08/norm-learns-rule-1.html">Norm Learns Rule 1</a></li>
<li class="zemanta-article-ul-li"><a href="http://marklogic.blogspot.com/2008/11/positioning-marklogic-server.html">Positioning MarkLogic Server</a></li>
</ul>
</fieldset>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4319/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4319&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2008/12/15/norm-walsh-making-the-case-for-xquery/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>Mark Logic in EContent Magazine Dynamic Navigation Story</title>
		<link>http://kellblog.com/2008/11/12/mark-logic-in-econtent-magazine-dynamic-navigation-story/</link>
		<comments>http://kellblog.com/2008/11/12/mark-logic-in-econtent-magazine-dynamic-navigation-story/#comments</comments>
		<pubDate>Wed, 12 Nov 2008 20:38:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[content architecture]]></category>
		<category><![CDATA[content delivery]]></category>
		<category><![CDATA[Information and Media]]></category>
		<category><![CDATA[XML server]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2008/11/12/mark-logic-in-econtent-magazine-dynamic-navigation-story/</guid>
		<description><![CDATA[A rather overdue post to highlight that Mark Logic was featured a few months back in an EContent Magazine story entitled Reaping Information: Dynamic Navigation Helps Users (PDF). Excerpts: Delivering information in ways that make the most sense to users &#8230; <a href="http://kellblog.com/2008/11/12/mark-logic-in-econtent-magazine-dynamic-navigation-story/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4291&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A rather overdue post to highlight that Mark Logic was featured a few months back in an <a href="http://www.econtentmag.com/">EContent Magazine</a> story entitled <a href="http://www.marklogic.com/press/EContent-reaping-information-dynamic-navigation.pdf">Reaping Information:  Dynamic Navigation Helps Users</a> (PDF).</p>
<p>Excerpts:<br />
<blockquote>Delivering information in ways that make the most sense to users is a key characteristic of <a href="http://www.marklogic.com/product/marklogic-server.html">MarkLogic Server</a>, an XML Server that allows users to store, manage, manipulate, and deliver information</p></blockquote>
<p>Indeed, a key use-case for MarkLogic is as an information delivery platform.  More:<br />
<blockquote>Media company <a href="http://www.alm.com/alm.asp">ALM</a> uses MarkLogic Server for its enterprise content repository, which holds more than 2 decades worth of news and analysis for and about the legal market.</p></blockquote>
<p><a href="http://www.incisivemedia.com/corporate/news/79">ALM was acquired by Incisive Media</a> a while back but nevertheless remains a customer.  More:<br />
<blockquote>Oxford University Press has organized its reference works on African-Americans into a central repository it calls the <a href="http://www.oxfordaasc.com/public/">African American Studies Center</a> (AASC), which allows researchers the ability to search through images and articles, arranging them in chronological order.</p></blockquote>
<p>AASC is not only a very cool MarkLogic-based application, but also &#8212; perhaps more importantly &#8212; it&#8217;s just one slice of Oxford&#8217;s content.</p>
<p>Once a publisher builds their content application platform, it is relatively easy to take different slices of their content to build new and different information products. For example, <a href="http://www.oxfordislamicstudies.com/">Oxford Islamic Studies Online</a> (OISO)  is built on the same platform as the AASC, and I&#8217;m sure the OISO&#8217;s marginal development cost was reduced because it could leverage the fixed costs invested the development of OUP&#8217;s (MarkLogic-based) publishing platform.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4291/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4291/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4291/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4291&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2008/11/12/mark-logic-in-econtent-magazine-dynamic-navigation-story/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>To SaaS or Not To SaaS: That is the Question</title>
		<link>http://kellblog.com/2008/10/01/to-saas-or-not-to-saas-that-is-the-question/</link>
		<comments>http://kellblog.com/2008/10/01/to-saas-or-not-to-saas-that-is-the-question/#comments</comments>
		<pubDate>Wed, 01 Oct 2008 14:11:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[Publishing 2.0]]></category>
		<category><![CDATA[SaaS]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2008/10/01/to-saas-or-not-to-saas-that-is-the-question/</guid>
		<description><![CDATA[[Revised, rewritten, and replacing a post from yesterday] One question we encounter with our Information and Media customers is whether they should buy MarkLogic Server and build an application on top of it, or use a SaaS offering (which may &#8230; <a href="http://kellblog.com/2008/10/01/to-saas-or-not-to-saas-that-is-the-question/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4261&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>[Revised, rewritten, and replacing a post from yesterday]</p>
<p>One question we encounter with our <a href="http://www.outsellinc.com/store/products/766?refid=home">Information and Media</a> customers is whether they should buy <a href="http://www.marklogic.com/product/marklogic-server.html">MarkLogic Server</a> and <span style="font-weight:bold;">build </span>an application on top of it, or use a <a href="http://en.wikipedia.org/wiki/Software_as_a_Service">SaaS</a> offering (which may or may not be based on MarkLogic) and effectively <span style="font-weight:bold;">rent </span>the use of an application to meet their online publishing needs.</p>
<p>The primary arguments in favor of the <span style="font-weight:bold;">rent </span>(SaaS) approach are:
<ul>
<li>You get <span style="font-weight:bold;">up and running faster</span> because you&#8217;re renting the use of an existing application</li>
<li>You have l<span style="font-weight:bold;">ower up-front fees</span> because you need neither to build your application nor buy the hardware/software platform on which to run it</li>
<li>You can <span style="font-weight:bold;">focus on what matters</span> because you are liberated from the nitty-gritty of building and deploying production systems</li>
</ul>
<p>The primary arguments in favor of the <span style="font-weight:bold;">build </span>approach are:
<ul>
<li>You create a unique offering which you can use to <span style="font-weight:bold;">differentiate from your competition</span></li>
<li>Your <span style="font-weight:bold;">costs are potentially lower over the mid-term</span> (SaaS&#8217;s relatively high annual payments reverse the initial savings over a few years; if you don&#8217;t believe me, remember that Wall Street values a dollar of SaaS revenue at about 2-3x a dollar of perpetual revenue)</li>
<li>You create a strategic platform on which you build future applications,  <span style="font-weight:bold;">reducing the marginal cost of experimentation</span> and new product development</li>
</ul>
<p>To me, SaaS is not a religious issue; it&#8217;s a practical one.</p>
<p>While we typically sell our software on a perpetual license basis, we nevertheless are a big user of SaaS solutions at Mark Logic.  We happily use <a href="http://www.salesforce.com/">Salesforce</a> and somewhat less happily use <a href="http://www.netsuite.com/">Netsuite</a>.  I was also a champion of bringing Salesforce into Business Objects, where we became one of their earliest, large enterprise customers. (As I told IT at the time:  if you won&#8217;t treat me as a customer, then I&#8217;ll go find someone who will.)</p>
<p>Turning back to the question of publishers and SaaS, like most questions in business, the answer should derive from strategy.
<ul>
<li>If you are trying to compete solely on the basis of your proprietary content, then you should consider a &#8220;rent&#8221; strategy.</li>
<li>If you are trying to compete on the basis of mixing content and its delivery mechanism, then should consider a &#8220;buy&#8221; strategy.</li>
<li>If you are in between, then you&#8217;ll need to figure out where you are on the continuum and what you&#8217;re willing to trade for what.</li>
</ul>
<p>As I always say, there are two things that money can&#8217;t buy:  love and competitive advantage. Applied here, if you can rent a solution then your competitor down the street can rent it, too, and no amount of application configuration is going to result in competitive advantage (or disadvantage) for either of you.</p>
<p>What does this mean?  It means that SaaS is great for what <a href="http://en.wikipedia.org/wiki/Geoffrey_Moore">Geoffrey Moore</a> calls &#8220;<a href="http://www.dealingwithdarwin.com/theBook/darwinDictionary.php">context</a>&#8221; and rotten for what he calls &#8220;<a href="http://www.dealingwithdarwin.com/theBook/darwinDictionary.php">core</a>.&#8221;  Excerpt from the referred page:
<p><strong></strong></p>
<blockquote><p><strong>Core</strong> &#8211; See <em> <a href="http://www.dealingwithdarwin.com/theBook/darwinDictionary.php#Corecontextanalysis">Core/context analysis</a></em><br />Any activity which creates sustainable differentiation in the target market resulting in premium prices or increased volume. Core management seeks to dramatically outperform all competitors within the domain of core.</p>
<p><strong><a name="Corecontextanalysis" id="Corecontextanalysis"></a></strong>
<p><strong>Context</strong> &#8211; See <em><a href="http://www.dealingwithdarwin.com/theBook/darwinDictionary.php#Corecontextanalysis">Core/context analysis</a></em><br />Any activity which does not differentiate the company from the customers&#8217; viewpoint in the target market. Context management seeks to meet (but not exceed) appropriate accepted standards in as productive a manner as possible. </p>
<p>     <strong></strong><a name="Corecontextanalysis" id="Corecontextanalysis"></a><strong><a name="Corecontextanalysis" id="Corecontextanalysis"></a></strong></p></blockquote>
<p>That&#8217;s why we happily use Salesforce and Netsuite at Mark Logic &#8212; we aren&#8217;t trying to differentiate on the basis of our accounts receiveable or pipeline management systems.  (We are trying to differentiate on technology, market focus, and services excellence.)</p>
<p>So, for publishers
<ul>
<li>The more your basis of competition is ownership of a proprietary content set, the more delivery becomes context, and the more you should consider SaaS</li>
</ul>
<ul>
<li>The more your basis of competition is (1) uniting your content with other content, (2) delivering content in unique in-context ways, and (3) rapid innovation in online product development, the more delivery is core, and the more you should build custom applications (i.e., new information products) on a standardized platform.</li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4261/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4261/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4261&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2008/10/01/to-saas-or-not-to-saas-that-is-the-question/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>Lazy XML Enrichment</title>
		<link>http://kellblog.com/2007/08/22/lazy-xml-enrichment/</link>
		<comments>http://kellblog.com/2007/08/22/lazy-xml-enrichment/#comments</comments>
		<pubDate>Wed, 22 Aug 2007 23:29:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[content architecture]]></category>
		<category><![CDATA[content delivery]]></category>
		<category><![CDATA[text analytics]]></category>
		<category><![CDATA[XQuery]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/08/22/lazy-xml-enrichment/</guid>
		<description><![CDATA[One of my big gripes with most content-oriented software is that it requires a big bang approach (see The First Step&#8217;s a Doozy). The basic premise behind most content software is roughly: 1. If you do all this hard work &#8230; <a href="http://kellblog.com/2007/08/22/lazy-xml-enrichment/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4017&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>One of my big gripes with most content-oriented software is that it requires a big bang approach (see <a href="http://marklogic.blogspot.com/2005/11/first-steps-doozy.html">The First Step&#8217;s a Doozy</a>).  The basic premise behind most content software is roughly:</p>
<p>1.  If you do all this hard work to perfectly standardize the schema of your content, perfectly tag it, and possibly perfectly shred it, then</p>
<p>2.  You can do cool stuff like content repurposing, content integration, multi-channel content delivery, and custom publishing.</p>
<p>The problem is, of course, that the first step is lethal.  Many content software projects blow up on the launchpad because they can&#8217;t get beyond step 1. Our first customer had been stuck on step 1 for 18 months with Oracle before they found Mark Logic.  (We loaded their content in a week.) At a recent Federal tradeshow, we had dinner with some folks from Booz Allen who&#8217;d been trying to load to some semi-structured message traffic data into a relational database for months.  We told them to swing by our booth the next day.  Our sales engineer then loaded their content over a cup of coffee while eating a muffin and built a basic application in an hour.  They couldn&#8217;t believe it.</p>
<p>In most companies &#8212; even publishers &#8212; content is a mess.  It&#8217;s in 100 different places in 15 different formats, and each defined format is usually more of an aspiration than a standard.  Once, at a multi-billion dollar publisher one of our technical guys actually found this sentence in some internal documentation:  &#8220;it is believed that this tag is used to &#8230;&#8221;  Only folklore describes the schema.</p>
<p>So when it comes to the general problem of making XML more rich &#8212; i.e., having more tags that indicate more meaning &#8212; many people take the same big-bang approach.  &#8220;Well, step 1 would be to put all the content into a single schema (which alone could kill you) and run it through a dozen different entity, fact, sentiment, concept, summarization &#8220;extractors&#8221; that can markup the content and fragments of it with lots of new and powerful tags (which alone could cost millions).</p>
<p>Again, step 1 becomes lethal.</p>
<p>At Mark Logic we advocate that people consider the opposite approach.  Instead of:
<ul>
<li>Step 1:  make the content perfect so you can enable any application you want to build</li>
<li>Step 2:  build an application</li>
</ul>
<p>We say:
<ul>
<li>Step 1:  figure out the application you want to build</li>
<li>Step 2:  figure out which portions of your markup need to be improved to build that application</li>
<li>Step 3:  improve only that markup, sometimes manually, sometimes with extraction software, and sometimes with heuristics (i.e., rules of thumb) coded in XQuery </li>
<li>Step 4:  build your application and get some business value from it</li>
<li>Step 5:  repeat the process, driven by subsequent application requirements</li>
</ul>
<p>I call this lazy XML enrichment.  You could call it application-driven, as opposed to infrastructure-driven, content cleanup.  I think it&#8217;s an infinitely better approach because it delivers business results faster and eliminates the risk of either never finishing the first step because it&#8217;s impossible, or having funding yanked by the business because it runs out of patience with an IT project that&#8217;s showing no ostensible progress.</p>
<p>At this point, I&#8217;d like to direct those of technical heart to Matt Turner&#8217;s <a href="http://xquery.typepad.com/">Discovering XQuery</a> blog where he provides a detailed post (code included) that shows an example of lazy, heuristic-based XML enrichment, <a href="http://xquery.typepad.com/xquery/2007/08/xquery-and-lazy.html">here</a>.
<ul>
<li>Matt&#8217;s example show lazy enrichment because the only markup he needs for his desired application is related to weapons, so that&#8217;s all he adds.</li>
</ul>
<ul>
<li>Matt&#8217;s example is heuristic-based because he devises a way to find weapons in XQuery, and then use XQuery to tag them as such.</li>
</ul>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/4017/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/4017/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4017/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4017/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4017/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4017&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/08/22/lazy-xml-enrichment/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>How The Web Disrupts the RDBMS World</title>
		<link>http://kellblog.com/2007/08/16/how-the-web-disrupts-the-rdbms-world/</link>
		<comments>http://kellblog.com/2007/08/16/how-the-web-disrupts-the-rdbms-world/#comments</comments>
		<pubDate>Thu, 16 Aug 2007 20:10:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[relational database]]></category>
		<category><![CDATA[XQuery]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/08/16/how-the-web-disrupts-the-rdbms-world/</guid>
		<description><![CDATA[I found an interesting post on The Future of Software minisite run by the GigaOM network, best known for Om Malik and his GigaOM blog. The post is entitled &#8220;Data 2.0: How the Web disrupts our relational database world&#8221; and &#8230; <a href="http://kellblog.com/2007/08/16/how-the-web-disrupts-the-rdbms-world/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4012&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I found an interesting post on <a href="http://future.gigaom.com/">The Future of Software</a> minisite run by the GigaOM network, best known for <a href="http://en.wikipedia.org/wiki/Om_Malik">Om Malik</a> and his <a href="http://gigaom.com/">GigaOM</a> blog.  The post is entitled &#8220;<a href="http://future.gigaom.com/2007/08/10/data-20-how-the-web-disrupts-our-relational-database-world/">Data 2.0:  How the Web disrupts our relational database world</a>&#8221; and is written by <a href="http://www.blogger.com/profile/364187">Nitin Borwankar</a>.</p>
<p>The post begins with:<br /><strong><br />
<blockquote>The great online shift is creating massive amounts of data &#8211; whether it is videos on YouTube or social networking profiles on MySpace. And that data is stored in databases, making them the key component of the new web infrastructure. But managing that information isn’t easy</p></blockquote>
<p></strong>I think he nails the problem statement.  The Web world is changing fast.  And relational databases are having trouble keeping up.<br />
<blockquote>The good news is that database management will be vastly different in the future. In fact, change has already begun; it just isn’t (cliché alert!) “<a href="http://en.wikipedia.org/wiki/William_gibson">evenly distributed</a>” yet.</p></blockquote>
<p>He then goes on to describe some leading examples of companies or problems that are pushing the relational database envelope.
<ol>
<li>Yahoo&#8217;s creation of its own user management software based on BerkeleyDB</li>
<li>Google&#8217;s <a href="http://labs.google.com/papers/mapreduce.html">MapReduce</a></li>
<li>Amazon&#8217;s S3 (simple storage service) and SQS (simple queue service) which externalize operations normally done by a database.</li>
<li>The general use of Lucene, Nutch, and Solr to do indexing of unstructured content, &#8220;something an old relational database cannot do well.&#8221;</li>
<li>The graph-structured data problem (also known as the parts explosion problem) inherent in social networking and which remains an Achilles&#8217; heel for relational databases</li>
</ol>
<p>So while I generally agree with his thesis, the examples cited are basically all technology companies who are able to write their own system-level software to bypass and/or accommodate the limitations of relational databases.</p>
<p>My question is:  what about everybody else?  What are they supposed to do?</p>
<p>My short answer is &#8212; perhaps not shockingly &#8212; MarkLogic.  At MarkLogic, we call Data 2.0 &#8220;content.&#8221;
<ul>
<li>We manage XML natively</li>
<li>We manage graph-structured data easily</li>
<li>We manage, search, storage and index text and XML natively</li>
</ul>
<p>Some companies will always be able to write their own stuff to get around problems.  But the reason MarkLogic exists is provide a commercial DBMS that &#8220;the rest of us&#8221; can use when managing content and building web applications with it. </p>
<p>See this post on <a href="http://marklogic.blogspot.com/2007/05/web-applications-virtues-of-top-to.html">top-to-bottom XML</a> for more.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/4012/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/4012/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4012/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4012/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4012/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=4012&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/08/16/how-the-web-disrupts-the-rdbms-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>From Search to Research &#8230; and Content Applications</title>
		<link>http://kellblog.com/2007/07/26/from-search-to-research-and-content-applications/</link>
		<comments>http://kellblog.com/2007/07/26/from-search-to-research-and-content-applications/#comments</comments>
		<pubDate>Thu, 26 Jul 2007 22:53:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/07/26/from-search-to-research-and-content-applications/</guid>
		<description><![CDATA[Here&#8217;s an interesting post on the Read/WriteWeb (RWW) blog, entitled From Search to (Re)Search, Searching for the Google Killer. It&#8217;s definitely worth reading, and the links within it, like this one where a Hakia guy explains quite articulately why Google &#8230; <a href="http://kellblog.com/2007/07/26/from-search-to-research-and-content-applications/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3993&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s an interesting post on the Read/WriteWeb (RWW) blog, entitled <a href="http://www.readwriteweb.com/archives/from_search_to_research.php">From Search to (Re)Search, Searching for the Google Killer.</a></p>
<p>It&#8217;s definitely worth reading, and the links within it, like <a href="http://www.readwriteweb.com/archives/competing_with_google_search.php">this one</a> where a <a href="http://www.hakia.com/">Hakia</a> guy explains quite articulately why Google is unstoppable, and then  unsuccessfully tries to dismiss his own arguments.</p>
<p>I agree with RWW that the Google killer won&#8217;t come from:
<ul>
<li>One-up feature companies.  Engines like Clusty, which add one feature (e.g., dynamic clustering) on top of Google search.</li>
</ul>
<ul>
<li>Vertical search companies.  While the long tail is real, I don&#8217;t believe there will be a long tail of search engines (that&#8217;s the inverse of the concept).  People want relatively few tools that can reach into the long tail of content, products, and information.  They don&#8217;t want a long tail of tools.</li>
</ul>
<ul>
<li>Human search.  Unless you&#8217;re doing real research, the cost model is prohibitive.  </li>
</ul>
<p>The first time I heard the phrase &#8220;research, not search&#8221; was from <a href="http://www.nerac.com/">Nerac</a> CEO <a href="http://www.nerac.com/about-us/management-team/">Kevin Bouley</a>.  Kevin&#8217;s company provides custom research services using a database of content integrated from numerous sources combined with a network of subject-matter experts (SMEs) who use an MarkLogic-based application to assemble custom research reports for clients.  When Kevin says &#8220;research, not search&#8221; he means it.</p>
<p>Nerac uses <a href="http://www.marklogic.com/products">MarkLogic</a> as their content repository and have a built an XQuery application that  enables SMEs to quickly locate information (using our XML search capabilities) and then combine and package that information into a custom research report.  It&#8217;s a very cool service, and while I think of it as &#8220;research, not search&#8221; I certainly don&#8217;t think of it as human-powered search a la <a href="http://www.chacha.com/">Cha Cha</a>.</p>
<p>While I believe that <span style="font-weight:bold;">from search to research</span> is a good direction, I think there is another equally important direction that the RWW omits:  <span style="font-weight:bold;">from search to application</span>, or as we say at Mark Logic &#8220;content application.&#8221;</p>
<p>To me, search is inherently open-ended and context-free.  Applications are not.  If I know you&#8217;re a professor and you want to build a custom textbook, then I can build an application that helps you do that.  And yes, that application will probably include search across a corpus of content.  But search is a feature in the application, not the application itself.</p>
<p>Or, if you&#8217;re a pathologist, I can build you an application that leverages how you work with terabytes of medical content to help you identify cancers more readily.  Search might be a feature within that application, but the application itself is about helping support the process of differential diagnosis.</p>
<p>Content applications know who you are and what you&#8217;re trying to do.  (They&#8217;re role and task aware.)  And you can build them on MarkLogic.  And, in my mind, only a content application has enough unfair competitive advantage to beat Google over time.  A thin vertical search layer?  A better algorithm?  One sexy feature?  No.</p>
<p>But an application that knows who you are, what you&#8217;re trying to to, and leverages a rich (potentially integrated and enriched) contentbase to do so?  Ah, well that&#8217;s no fair.  No search engine can do that.</p>
<p>And that&#8217;s the point.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/3993/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/3993/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/3993/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/3993/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/3993/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3993&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/07/26/from-search-to-research-and-content-applications/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>The Relevancy Quest</title>
		<link>http://kellblog.com/2007/06/26/the-relevancy-quest/</link>
		<comments>http://kellblog.com/2007/06/26/the-relevancy-quest/#comments</comments>
		<pubDate>Tue, 26 Jun 2007 16:54:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[Internet Search]]></category>
		<category><![CDATA[vertical search]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/06/26/the-relevancy-quest/</guid>
		<description><![CDATA[In the classic book, The Innovator&#8217;s Dilemma, Clayton Christensen concludes that a key reason leading companies fail is because they spend too much energy working on sustaining innovations that continuously improve their products for their existing customers. Seemingly paradoxically, he &#8230; <a href="http://kellblog.com/2007/06/26/the-relevancy-quest/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3978&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In the classic book, <a href="http://www.amazon.com/exec/obidos/tg/detail/-/0060521996/qid=1101756443/sr=8-1/ref=pd_ka_1/102-0228227-9568947?v=glance&amp;s=books&amp;n=507846">The Innovator&#8217;s Dilemma</a>, Clayton Christensen concludes that a key reason leading companies fail is because they spend too much energy working on sustaining innovations that continuously improve their products for their <span>existing </span>customers. Seemingly paradoxically, he points out that these sustaining innovations can involve very advanced and very expensive technology.  That is, it&#8217;s not the nature of the technology used  (e.g., advanced or simple) that causes innovation to be sustaining or disruptive &#8212; it&#8217;s who the technology is designed to serve and in what uses.</p>
<p>I think search vendors need to dust off their copies of The Innovator&#8217;s Dilemma.  Why?  Because, for the most part, they seemed wedged in the following paradigm, which I&#8217;d call the relevancy quest:
<ul>
<li>Search is about grunting a     few keywords</li>
</ul>
<ul>
<li>The answer is a list of links</li>
</ul>
<ul>
<li>The quest is then magically      inducing the most relevant links given a few grunts</li>
</ul>
<p>And it&#8217;s not a bad paradigm.  Heck, it made Google worth $140B and bought Larry and Sergey a nice <a href="http://www.post-gazette.com/pg/05308/600836.stm">767</a>.  But can we do better?</p>
<p>Some folks, like the much-hyped <a href="http://www.powerset.com/">Powerset</a>, think so.  They&#8217;re challenging the <a href="http://www.barneypell.com/archives/2007/02/powerset_series.html">grunting</a> part of the equation, arguing that &#8220;keyword-ese&#8221; is the problem and the solution is natural language.  They seem unphased both by Ask Jeeves&#8217; failure to dominate search and by the more than 20 years of failed attempts to provide natural language interfaces to database data, used for business intelligence (BI).  As I often say, if natural language were the key to BI user interfaces, then <a href="http://www.businessobjects.com/">Business Objects</a> would have been purchased by Microsoft years ago for a pittance and Natural Language Inc.&#8217;s DataTalker would rule BI.  (Instead of the other way around.)</p>
<p>But I respect Powerset because at least they&#8217;re challenging the paradigm and taking a different approach to the problem.  And, while I sure don&#8217;t understand the cost model, I also respect guys like <a href="http://www.chacha.com/">ChaCha </a>because they&#8217;re challenging the paradigm, too.  In ChaCha&#8217;s case, they&#8217;re delivering human-powered search where you can literally chat with a live guide who helps you refine your search.</p>
<p>I can also respect the social search guys, including the recently launched <a href="http://www.mahalo.com/">Mahalo</a>, because they&#8217;re challenging the paradigm as well &#8212; using <a href="http://www.randomhouse.com/features/wisdomofcrowds/">Wisdom of Crowds</a> / Web 2.0 / Wikipedia style collaboration to created &#8220;hand-written results pages&#8221; for topics, such as the always searchable &#8220;<a href="http://www.mahalo.com/Paris_Hilton">Paris Hilton</a>.&#8221;</p>
<p>The folks I have trouble understanding are those on the algorithmic relevancy quest, companies like <a href="http://www.hakia.com/">Hakia</a>, a semantic search vendor (interviewed <a href="http://www.readwriteweb.com/archives/semantic_search_antidote_for_poor_relevancy.php">here</a> by Read/Write Web) whose schtick is <a href="http://company.hakia.com/technology.html">meaning-based search</a>, and who comes complete with a PageRank &#8482; rip-off-name algorithm called SemanticRank &#8482;. Or <a href="http://www.ask.com/">Ask</a> who recently launched a $100M advertising campaign about &#8220;<a href="http://www.thealgorithm.com/">the algorithm</a>&#8220;.  These people remind me of the disk drive manufacturers who invested millions in very advanced technologies for improved 8&#8243; disk drives (to serve their existing customers) all the while missing the market for 5.25&#8243; disk drives required by different customers (i.e., PC manufacturers).</p>
<p>Are the Hakias of the world answering the right question?   Should we be grunting keywords into search boxes and relying on SomethingRank &#8482; to do the best job of determining relevancy? Is the search battle of the future really about &#8220;my rank&#8217;s better than you rank&#8221; or equivalently, &#8220;my PhD&#8217;s smarter than your PhD&#8221;?  Aren&#8217;t these guys fighting the last war?</p>
<p>As usual, I think there are separate answers for Internet and enterprise search.</p>
<p>On the Internet side, sure I think search engines can certainly use more &#8220;magic&#8221; to improve search relevancy.  For example, they can use recent queries and a user profile to impute intent.  They can use dynamic clustering and iterative query refinement (e.g., faceted navigation) to help users incrementally improve the precision of their queries.</p>
<p>More practically, I think vertical search and community sites are a great way of improving search results.  The context of the site you&#8217;re on provides a great clue to what you&#8217;re looking for.  Typing &#8220;Paris Hilton&#8221; into <a href="http://www.expedia.com/">Expedia</a> means you&#8217;re probably looking for a hotel, where typing it <a href="http://www.eonline.com/">EOnLine</a> means you&#8217;re looking for information on the jailed debutante.</p>
<p>Of course, there are a host of Web 2.0 style techniques to improve search like diggs and wikis which can be put to work as well.</p>
<p>Increasingly, our publishing and media customers are going well beyond &#8220;improving search&#8221; and changing the paradigm to &#8220;<a href="http://marklogic.blogspot.com/2007/02/buxton-ieee-article-beyond-search.html">content applications</a>&#8221; &#8212; systems that combine software and content to help specific users accomplish specific tasks.  See Elsevier&#8217;s <a href="http://65.61.35.58/pathology/Start_PathCONSULT_Demo.htm">PathConsult</a> as a concrete example.</p>
<p>On the enterprise search side, I think the answer is different.  As I&#8217;ve often mentioned, on the enterprise side you lack the rich link structure of the web, effectively lobotomizing PageRank and robbing Google of its once-special (and now increasingly gamed and hacked) sauce.</p>
<p>When I look for the answer of how to improve search in an enterprise context, I look back to BI, where we have decades of history to guide us about the quest to enable end-user access to corporate data.
<ul>
<li>Typing SQL (once seriously considered as the answer) failed.   Too complex.  While SQL itself was the great enabler of the BI industry, end users could never code it.</li>
</ul>
<ul>
<li>Creating reports in 4GL languages failed.  Too complex.</li>
</ul>
<ul>
<li>Having other people create reports and deliver them to end users was a begrudging success.  While this created a report treadmill/backlog for IT and buried end-users in too much information, it was probably the most widely used paradigm.</li>
</ul>
<ul>
<li>Natural language interfaces failed.  Too hard to express what you really want.  Too much precision required.  Too much iteration required.</li>
</ul>
<ul>
<li>End users using graphical tools linked directly to the database schema failed.  While these tools hid the complexities of SQL, they failed to hide the complexity of the database schema.</li>
</ul>
<p>It was only when Business Objects invented a graphical, SQL-generating tool that hid all underlying database complexity and enabled users to compose an arbitrary query that the BI market took off.  Simply put, there were two keys:</p>
<p>1.  The ability to phrase an arbitrary query of arbitrary complexity (not a highly constrained search).</p>
<p>2.  The ability to hide the complexity of the database from the underlying user</p>
<p>While no one has yet built a such a tool for an arbitrary XML contentbase (and while I think building one will be hard given the lack of requirement for a defined schema), MarkLogic customers use our product every day to build content applications that generate complex queries against large contentbases, and completely hide XQuery from the end-user.</p>
<p>Simply put, it&#8217;s not about improving search.  It&#8217;<br />
s about delivering query.  That&#8217;s the game-changer.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/3978/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/3978/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/3978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/3978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/3978/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3978&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/06/26/the-relevancy-quest/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>The High Cost of Ineffective Search</title>
		<link>http://kellblog.com/2007/03/04/the-high-cost-of-ineffective-search/</link>
		<comments>http://kellblog.com/2007/03/04/the-high-cost-of-ineffective-search/#comments</comments>
		<pubDate>Sun, 04 Mar 2007 16:10:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[Enterprise Search]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/03/04/the-high-cost-of-ineffective-search/</guid>
		<description><![CDATA[Just a quick post to a recent article on the costs associated with ineffective enterprise search. Tidbits include: According to IDC, a company with 1,000 information workers can expect more than $5M in annual wasted salary costs because of poor &#8230; <a href="http://kellblog.com/2007/03/04/the-high-cost-of-ineffective-search/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3938&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Just a quick post to a recent <a href="http://www.cio-today.com/story.xhtml?story_id=032001WANPHC&amp;page=1">article</a> on the costs associated with ineffective enterprise search. </p>
<p>Tidbits include:
<ul>
<li>According to IDC, a company with 1,000 information workers can expect more than $5M in annual wasted salary costs because of poor search.</li>
</ul>
<ul>
<li>A recent survey of 1,000 middle managers found that more than half the information they find during searching is useless.</li>
</ul>
<ul>
<li>According to Butler Group, as much as 10% of a company&#8217;s salary costs are wasted through ineffective search.</li>
</ul>
<ul>
<li>According to <a href="http://www.idc.com/getdoc.jsp?containerId=PRF000110">Sue Feldman</a> of IDC, people spend 9-10 hours per week searching for information and aren&#8217;t successful 1/3 to 1/2 the time.</li>
</ul>
<p>As I always say, there&#8217;s a reason why &#8220;enterprise search sucks&#8221; returns over 1M hits on Google, including posts from luminaries such as <a href="http://weblog.infoworld.com/udell/2006/04/10.html">John Udell</a> and <a href="http://www.cmswatch.com/Trends/643-Improving-Intranet-Search">Tony Byrne</a>.</p>
<p>While Mark Logic is not out to solve the generic enterprise search problem, I have long believed that enterprise search, as a catgory, will become stuck between a rock and a hard place.
<ul>
<li>The rock is the commoditization of the low-end enterprise search market through offerings like the Google Appliance and IBM OmniFind Yahoo Edition.  This will suck the money out of the low end, the generic crawl-and-index market.</li>
</ul>
<ul>
<li>The hard place is DBMSs &#8212; specifically, DBMS-based content applications built to help people in specific roles perform specific tasks.  Some people build these applications today by trying to bolt together an enterprise search engine and a DBMS (e.g., Oracle + Verity or Lucene + MySQL), but increasing I believe people will use XML content servers (special-purpose DBMSs designed to handle content) for this purpose.</li>
</ul>
<p>When you think about it, an inverted keyword index can only help you so much when trying to solve a problem &#8212; even if you gussy it up with taxonomies and sexy extraction technology.  In the end, an application designed to solve a specific problem will trump a souped-up tool every time.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/3938/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/3938/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/3938/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/3938/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/3938/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3938&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/03/04/the-high-cost-of-ineffective-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>Buxton IEEE Article: Beyond Search, Content Applications</title>
		<link>http://kellblog.com/2007/02/14/buxton-ieee-article-beyond-search-content-applications/</link>
		<comments>http://kellblog.com/2007/02/14/buxton-ieee-article-beyond-search-content-applications/#comments</comments>
		<pubDate>Wed, 14 Feb 2007 22:41:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[Buxton]]></category>
		<category><![CDATA[content applications]]></category>
		<category><![CDATA[Enterprise Search]]></category>
		<category><![CDATA[XML content server]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/02/14/buxton-ieee-article-beyond-search-content-applications/</guid>
		<description><![CDATA[Mark Logic&#8217;s own Stephen Buxton, co-author of the definitive tome, Querying XML, has recently published an article in IT Pro (a publication of the IEEE Computer Society) entitled &#8220;Beyond Search: Content Applications.&#8221; Here is a link to the article (subscription &#8230; <a href="http://kellblog.com/2007/02/14/buxton-ieee-article-beyond-search-content-applications/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3932&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://davidkellogg.files.wordpress.com/2010/09/img.jpg"><img class="alignright size-full wp-image-6775" title="IT Professional Enterprise Search img" src="http://davidkellogg.files.wordpress.com/2010/09/img.jpg?w=500" alt=""   /></a>Mark Logic&#8217;s own Stephen Buxton, co-author of the definitive tome, <a href="http://xqzone.marklogic.com/queryingxmlbook/">Querying XML</a>, has recently published an article in <a href="http://www.computer.org/portal/site/itpro/index.jsp">IT Pro</a> (a publication of the IEEE Computer Society) entitled &#8220;Beyond Search:  Content Applications.&#8221;</p>
<p>Here is a <a href="http://csdl2.computer.org/persagen/DLAbsToc.jsp?resourcePath=/dl/mags/it/&amp;toc=comp/mags/it/2007/01/f1toc.xml&amp;DOI=10.1109/MITP.2007.4">link</a> to the article (subscription required).  If you press the link you can either view the abstract or buy the article for $19.  Here&#8217;s a <a href="http://www.computer.org/portal/cms_docs_itpro/itpro/content/promo1.pdf">link</a> to the editor&#8217;s introduction of the issue (free), where he says:</p>
<blockquote><p>&#8220;Stephen Buxton’s article on XML content servers describes the unique capabilities of this form of repository system and the extreme precision and information extraction that it can achieve.  The server’s content of unstructured text is richly tagged, usually by inflow entity extractors or taxonomies.  This provides a high degree of semantic quality and makes high relevancy search and disambiguation possible.  Search, as well as other applications, can be developed to sit atop the server and take full advantage of the metadata.  In this way, the enterprise can benefit from true information extraction in search as well as in other applications requiring high precision and a degree of semantic awareness.&#8221;</p></blockquote>
<p>In the article Buxton differentiates enterprise search engines from XML content servers as candidate platforms for content applications.</p>
<p>He also discusses several example content applications, including:</p>
<ul>
<li>The Oxford University Press African American Studies Center, an online product for social sciences libraries and researchers that does extensive content integration and repurposing</li>
</ul>
<ul>
<li>O&#8217;Reilly Media&#8217;s SafariU, a custom publishing system that enables professors to build custom books, online through a web interface with printed versions shipped to the campus bookstore in about 2 weeks</li>
</ul>
<ul>
<li>Elsevier&#8217;s PathConsult, a highly contextual application designed for pathologists in order to assist them in the tricky task of differential diagnosis.</li>
</ul>
<p>It&#8217;s worth the $19 &#8212; go ahead and <a href="https://newton.computer.org/DocDelivery/Shopping.nsf/AddToCart?OpenAgent&amp;U=http://csdl.computer.org/dl/mags/it/2007/01/f1029.pdf&amp;T=Beyond%20Search:%20Content%20Applications">get</a> the article.  Heck, it&#8217;s cheaper and faster to read than his book!</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/3932/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/3932/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/3932/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/3932/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/3932/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3932&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/02/14/buxton-ieee-article-beyond-search-content-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>

		<media:content url="http://davidkellogg.files.wordpress.com/2010/09/img.jpg" medium="image">
			<media:title type="html">IT Professional Enterprise Search img</media:title>
		</media:content>
	</item>
		<item>
		<title>Rule 1 of Database Performance</title>
		<link>http://kellblog.com/2007/02/14/rule-1-of-database-performance/</link>
		<comments>http://kellblog.com/2007/02/14/rule-1-of-database-performance/#comments</comments>
		<pubDate>Wed, 14 Feb 2007 17:58:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[content applications]]></category>
		<category><![CDATA[Publishing 2.0]]></category>
		<category><![CDATA[Rule 1]]></category>
		<category><![CDATA[thick middle tier]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/02/14/rule-1-of-database-performance/</guid>
		<description><![CDATA[Here&#8217;s a link to a post done by Matt Turner on his Discovering XQuery blog that discusses Publishing 2.0 and content logic. In this post Matt discusses what I call the &#8220;thick middle tier&#8221; problem with most search-engine-based content applications. &#8230; <a href="http://kellblog.com/2007/02/14/rule-1-of-database-performance/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3931&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://xquery.typepad.com/photos/uncategorized/apptier2_1.jpg"><img style="float:right;width:275px;height:213px;margin:0 0 10px 10px;" src="http://xquery.typepad.com/photos/uncategorized/apptier2_1.jpg" alt="" border="0" /></a>Here&#8217;s a link to a <a href="http://xquery.typepad.com/xquery/2007/02/publishing_20_a.html">post</a> done by Matt Turner on his <a href="http://xquery.typepad.com/xquery/">Discovering XQuery</a> blog that discusses Publishing 2.0 and content logic.</p>
<p>In this post Matt discusses what I call the &#8220;thick middle tier&#8221; problem with most search-engine-based content applications.</p>
<p>Here&#8217;s the issue.  Search engines (1) return lists of links to documents and (2) allow only fairly basic &#8220;query&#8221; (and I&#8217;m reluctant to even call them that) predicates to be applied in the search engine.</p>
<p>As a result, a typical search-engine-based application ends up with a thick middle tier of Java code that (1) systematically materializes each document in the returned list as a <a href="http://en.wikipedia.org/wiki/Document_Object_Model">DOM</a> tree and then (2) does subsequent processing on that document using Java.</p>
<p>As Matt points out, you might be tempted to think of this work as &#8220;your application&#8221; or &#8220;business logic,&#8221; but in reality it&#8217;s not.  It&#8217;s content processing, not business or application processing.  This approach is bad for several reasons:
<ul>
<li>Productivity is negatively impacted because you have to do low-level content processing yourself, and typically in a relatively low-level language, like Java</li>
</ul>
<ul>
<li>Performance is negatively impacted because you end up with an architecture that violates <span style="font-weight:bold;">&#8220;rule 1&#8243; of database performance &#8212; push processing to the data, don&#8217;t bring data to the processing</span></li>
</ul>
<p>All DBMSs strive for compliance with rule 1.
<ul>
<li>Query optimizers always apply the most restrictive predicate first (e.g., apply emp-id = 178 before sex = female)</li>
</ul>
<ul>
<li>Query optimizers always do lookup joins from the table with the most restrictive predicates on it (where dept.dname = &#8220;fieldmkt&#8221; as opposed to emp.name = &#8220;*stein*&#8221;)</li>
</ul>
<ul>
<li>It&#8217;s why everyone loves stored procedures.  Not only do they minimize client/server interaction and allow pre-compilation, most importantly, they push processing to the data.</li>
</ul>
<p>I&#8217;m not going to criticize people who built systems this way historically.  Prior to products like <a href="http://www.marklogic.com/products">MarkLogic</a>, the thick-middle-tier architecture was the best you could do.  DBMSs couldn&#8217;t handle content so the best you could do was to leave your content in files (or stuff it in BLOBs), index it with a search engine, and then build these thick-middle-tier applications.</p>
<p>But in the future it doesn&#8217;t have to be this way.  With systems like MarkLogic, you can now build content applications using a standard query language (<a href="http://en.wikipedia.org/wiki/XQuery">XQuery</a>) and the &#8220;correct&#8221; allocation of processing across tiers.  This has the following benefits:</p>
<ul>
<li>Improved productivity because XQuery is a relatively high-level language</li>
<li>Greatly improved performance because you can thin-out the middle tier and push content  processing to the XML content server (which is both optimized to do it and close to the content)</li>
<li>Openness and standardization, which makes it easier to find skilled resources, eliminates vendor lock-in, and makes software integration generally easier.</li>
<li>Flexibility.  Typically with enough smarts in the middle layer you can hack something together than runs one query fast.  The trick is when you want to run many and/or new queries fast &#8212; in that case, you really need the right architecture &#8212; i.e., one that pushes processing to the content instead of bringing content to the processing.</li>
</ul>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/3931/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/3931/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/3931/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/3931/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/3931/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&amp;blog=11070789&amp;post=3931&amp;subd=davidkellogg&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/02/14/rule-1-of-database-performance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>

		<media:content url="http://xquery.typepad.com/photos/uncategorized/apptier2_1.jpg" medium="image" />
	</item>
	</channel>
</rss>
