<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Kellblog &#187; special-purpose DBMS</title>
	<atom:link href="http://kellblog.com/category/special-purpose-dbms/feed/" rel="self" type="application/rss+xml" />
	<link>http://kellblog.com</link>
	<description>The official blog of Dave Kellogg</description>
	<lastBuildDate>Tue, 01 May 2012 01:07:36 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='kellblog.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/8ecfcffdb3cd0948a0c38207c0ca38d6?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Kellblog &#187; special-purpose DBMS</title>
		<link>http://kellblog.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://kellblog.com/osd.xml" title="Kellblog" />
	<atom:link rel='hub' href='http://kellblog.com/?pushpress=hub'/>
		<item>
		<title>Yes, Virginia, MarkLogic is a NoSQL System</title>
		<link>http://kellblog.com/2010/04/11/yes-virginia-marklogic-is-a-nosql-system/</link>
		<comments>http://kellblog.com/2010/04/11/yes-virginia-marklogic-is-a-nosql-system/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 03:07:32 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[database management systems]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[information infrastrutcure]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[Michael Stonebraker]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[Open source]]></category>
		<category><![CDATA[RDBMS]]></category>
		<category><![CDATA[relational database]]></category>
		<category><![CDATA[semi-structured data]]></category>
		<category><![CDATA[special-purpose DBMS]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[unstructured data]]></category>
		<category><![CDATA[XML]]></category>
		<category><![CDATA[XML database]]></category>
		<category><![CDATA[XQuery]]></category>

		<guid isPermaLink="false">http://www.kellblog.com/?p=4710</guid>
		<description><![CDATA[The other day I noticed a taxonomy used on one of the NoSQL Database blogs that went like this: Types of NoSQL systems Core NoSQL Systems Wide column stores Document stores Key-value / tuple stores Eventually consistent key-value stores Graph &#8230; <a href="http://kellblog.com/2010/04/11/yes-virginia-marklogic-is-a-nosql-system/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4710&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>The other day I noticed a taxonomy used on one of the <a href="http://nosql-database.org/">NoSQL Database</a> blogs that went like this:</p>
<blockquote><p><strong>Types of NoSQL systems</strong></p>
<ul>
<li>Core NoSQL Systems
<ul>
<li>Wide column stores</li>
<li>Document stores</li>
<li>Key-value / tuple stores</li>
<li>Eventually consistent key-value stores</li>
<li>Graph databases</li>
</ul>
</li>
</ul>
<ul>
<li>Soft NoSQL Systems (not the original intention &#8230;)
<ul>
<li>Object databases</li>
<li>Grid database solutions</li>
<li>XML databases</li>
<li>Other NoSQL-related databases</li>
</ul>
</li>
</ul>
</blockquote>
<p>I, perhaps obviously, take some umbrage at having <a href="http://www.marklogic.com/product/marklogic-server.html">MarkLogic</a> (acceptably classified as an XML database) being declared “soft NoSQL.”  In this post I’ll explain why.</p>
<p>Who decided that being open source was a requirement to be <strong>real </strong>NoSQL system?  More importantly, who gets to decide?  NoSQL – <a href="http://www.kellblog.com/2010/02/24/the-database-tea-party-the-nosql-movement/">like the Tea Party</a> – is a grass-roots, effectively leaderless movement towards relational database alternatives.  Anyone arguing original intent of the founders is misguided because there is no small group of clearly identified founders to ask.  In reality, all you can correctly argue is what you think was the intent of the initial NoSQL developers and early adopters, or &#8212; perhaps more customarily &#8212; why you were drawn to them yourself, disguised or confused as original founder intent.</p>
<p>As mentioned <a href="http://www.kellblog.com/2010/03/09/my-thoughts-on-the-nosql-database-tea-party-post/">here</a>, movements often appear homogeneous when they are indeed heterogeneous.  What looks like a long line of demonstrators protesting a single cause is in fact a rugby scrum of different groups pushing in only generally aligned directions.  For example, for each of the following potential motivations, I am certain that I can find some set of NoSQL advocates that are motivated by it:</p>
<ul>
<li>Anger at Oracle’s<a href="http://www.informationweek.com/news/global-cio/security/showArticle.jhtml?articleID=222002898"> heavy-handed licensing policies</a></li>
<li>The need to store unstructured or semi-structured data that doesn’t fit well into relations</li>
<li>The <a href="http://en.wikipedia.org/wiki/Object-relational_impedance_mismatch">impedance mismatch</a> with relational databases</li>
<li>A need and/or desire to use open source</li>
<li>An attempt to reduce total cost</li>
<li>A desire to land at a different point in the <a href="http://devblog.streamy.com/2009/08/24/cap-theorem/">Brewer CAP Theorem</a> triangle of consistency, availability, and partition tolerance</li>
<li>Coolness / wannabe-ism, as in, I want to be like Google or Facebook</li>
</ul>
<p>(Since this was a source of confusion in prior posts, note that this is <strong>not </strong>to claim the inverse:  that all NoSQL advocates are motivated by all of the possible motivations.)</p>
<p>I’d like to advocate a simple idea:  that NoSQL means NoSQL.  That a NoSQL system is defined as:</p>
<blockquote><p>A <a href="http://en.wikipedia.org/wiki/Structured_storage">structured storage</a> system that is not based on relational database technology and does not use SQL as its primary query language</p></blockquote>
<p>In short, my proposed definition means that NoSQL (broadly) = NoSQL (literally) + NoRelational.  In short:  relational database alternatives.  It does not mean:</p>
<ul>
<li>NoDBMS.  We should not take NoSQL to exclude systems we would traditionally define as DBMSs.  For example, supporting <a href="http://en.wikipedia.org/wiki/ACID">ACID</a> transactions or supporting a non-SQL query language (e.g., <a href="http://en.wikipedia.org/wiki/XQuery">XQuery</a>) should not be exclusion criteria for NoSQL.</li>
</ul>
<ul>
<li>NoCommercialSoftware.  While many of the flagship NoSQL projects (e.g., Hadoop, CouchDB) are open source projects, that should be not a defining criterion.  NoSQL should be a technological, not a delivery- or business-model, classification.  Technology and delivery model are orthogonal dimensions.   We should be able to speak of traditionally licensed, open source licensed, and cloud-hosted NoSQL systems if for no other reason than <a href="http://blogs.the451group.com/opensource/2010/03/25/please-break-our-open-source-business-strategy-model/">understanding the nuances of the various business/delivery models is a major task</a> unto itself.  Do you mean open source or open core?  Is it open source or faux-pen source?  Under which <a href="http://en.wikipedia.org/wiki/Open-source_license">open source license</a>?  How should I think of a hosted subscription service that is a based on or a derivative of an open source project?</li>
</ul>
<p>Recently, I’ve heard a piece of backpeddling that I’ve found rather irritating:  that NoSQL was never intended to mean “no SQL,” it was actually intended to mean “<a href="http://groups.google.com/group/nosql-discussion/browse_thread/thread/5da23890279c15f2">not only SQL</a>.”  Frankly, this strikes me as hogwash:  uh oh, I’m afraid that people are seeing us as disruptors and it’s probably easier to penetrate the enterprise as complementary, not competitive, so let’s turn what was a direct assault into a flanking attack.</p>
<p>To me, it’s simple:  NoSQL means NoSQL.  No SQL query language and no relational database management system.  Yes, it’s disruptive and &#8212; by some measures &#8212; “crazy talk” but no, we shouldn’t hide because there are lots of perfectly valid (and now socially acceptable) reasons to want to differ from the relational status quo.</p>
<p>In effect, my definition of NoSQL is relational database alternative.  Such options include both <strong>alternative databases</strong> (e.g., MarkLogic) and <strong>database alternatives</strong> (e.g., key/value stores).  This, of course, then cuts at your definition of database management system where I (for now at least) still require the support of a query language and the option to have ACID transactions.</p>
<p>By the way, I understand the desire to exclude various bandwagon-jumpers from the NoSQL cause.  Like most, I have no interest in including thrice-reborn object databases in the discussion, but if the cost of excluding them is excluding systems like MarkLogic then I think that cost is too high.  Many people contemplating the top-of-mind NoSQL systems (e.g., Hadoop) could be better served using MarkLogic which addresses many typical NoSQL concerns, including:</p>
<ul>
<li> Vast scale</li>
<li>High performance</li>
<li> Highly parallel shared-nothing clusters</li>
<li> Support for unstructured and semi-structured data</li>
</ul>
<p>All with all the pros (and cons) of being a commercial software package and without requiring reduced consistency:  losing a few Tweets won’t kill Twitter, but losing a few articles, records, or individuals might well kill a patient, bank, or counter-terrorism agency.  <a href="http://en.wikipedia.org/wiki/Eventual_consistency">BASE</a> is fine for some; many others still need ACID.  Michael Stonebraker has some further points on this idea in this <a href="http://cacm.acm.org/blogs/blog-cacm/83396-errors-in-database-systems-eventual-consistency-and-the-cap-theorem/fulltext">CACM post</a>.</p>
<p>I’d like to suggest that we should combine the ideas in this post with the ideas in my prior one, <a href="http://www.kellblog.com/2010/03/31/classifying-database-management-systems-regular-and-nosql/">Classifying Database Management Systems</a>.  That post says the correct way to classify DBMSs is by their native modeling element (e.g., table, class, hypercube).  This post says that NoSQL is semi-orthogonal – i.e., I can imagine a table-oriented database that doesn’t use SQL as its query language, but I doubt that any exist.  Applying my various rules, the combined posts say that:</p>
<ul>
<li> Aster is a SQL database optimized for analytics on big data</li>
<li> MarkLogic is an XML [document] database optimized for large quantities of semi-structured information and a NoSQL system</li>
<li> CouchDB is a document database and a NoSQL system</li>
<li> Reddis is a key/value store and a NoSQL system</li>
<li>VoltDB is a SQL database optimized to solve one of the two core problems that NoSQL systems are built for (i.e., <a href="http://www.dbms2.com/2010/03/13/the-naming-of-the-foo/">high-volume simple processing</a>)</li>
</ul>
<p>Finally, I’d conclude that even with these rules I have trouble classifying MarkLogic because of multiple inheritance:  <strong>MarkLogic is both a document database and an XML database</strong>, it is difficult to pick one over the other, and I there certainly are non-document-oriented XML database systems.   Similar issues exist with classifying the various hybrids of document databases and key/value stores.  So while I may have more work to do on building an overall taxonomy, I am absolutely sure about one thing:  MarkLogic is a NoSQL system.</p>
<p>&#8212;<br />
* The “Yes, Virginia” phrase comes from a 1897 story in the New York Sun.  For more, see <a href="http://en.wikipedia.org/wiki/Yes,_Virginia,_there_is_a_Santa_Claus">here</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4710/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4710/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4710/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4710&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2010/04/11/yes-virginia-marklogic-is-a-nosql-system/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>Stonebraker: Send Relational DBMSs to the Home for Tired Software</title>
		<link>http://kellblog.com/2009/07/02/stonebraker-send-relational-dbmss-to-the-home-for-tired-software/</link>
		<comments>http://kellblog.com/2009/07/02/stonebraker-send-relational-dbmss-to-the-home-for-tired-software/#comments</comments>
		<pubDate>Thu, 02 Jul 2009 14:30:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[relational database]]></category>
		<category><![CDATA[special-purpose DBMS]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2009/07/02/stonebraker-send-relational-dbmss-to-the-home-for-tired-software/</guid>
		<description><![CDATA[Mike Stonebraker spoke today at SIGMOD (see Tweetstream) where, among other things there was a 40-year anniversary celebration of the relational DBMS and, in what I suspect is non-coincidental timing, Mike did a post on the CACM site entitled The &#8230; <a href="http://kellblog.com/2009/07/02/stonebraker-send-relational-dbmss-to-the-home-for-tired-software/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4461&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Michael_Stonebraker">Mike Stonebraker</a> spoke today at <a href="http://www.sigmod.org/">SIGMOD</a> (see <a href="http://search.twitter.com/search?q=stonebraker">Tweetstream</a>) where, among other things there was a 40-year anniversary celebration of the relational DBMS and, in what I suspect is non-coincidental timing, Mike did a post on the <a href="http://cacm.acm.org/">CACM</a> site entitled <a href="http://cacm.acm.org/blogs/blog-cacm/32212-the-end-of-a-dbms-era-might-be-upon-us/fulltext">The End of a DBMS Era (Might be Upon Us)</a>.</p>
<p>Excerpt:<br />
<blockquote>Moreover, the code line from all of the major vendors is quite elderly, in all cases dating from the 1980s. Hence, the major vendors sell software that is a quarter century old, and has been extended and morphed to meet today’s needs. In my opinion, these legacy systems are at the end of their useful life. They deserve to be sent to the “home for tired software.”</p></blockquote>
<p>His key argument is all about performance:  in any given use-case, Stonebraker thinks RDBMSs can be beaten by about a factor of 50.
<ul>
<li>In data warehousing he says a <a href="http://en.wikipedia.org/wiki/Column-oriented_DBMS">column store</a> wins by 50x</li>
</ul>
<ul>
<li>In <a href="http://en.wikipedia.org/wiki/OLTP">OLTP</a> he says a memory-resident DBMS wins by 50x</li>
</ul>
<ul>
<li>For scientific data, he says <a href="http://cacm.acm.org/blogs/blog-cacm/22489-dbmss-for-science-applications-a-possible-solution/fulltext">a DBMS specialized for the job</a> can win by 50x</li>
</ul>
<ul>
<li>For <a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a>, he says column stores do a reasonable job and is confident that specialized <a href="http://esw.w3.org/topic/LargeTripleStores">RDF triple stores</a> will do better, i.e., 50x or more.  (I&#8217;d add that at MarkLogic we think we do a reasonable job as well.)</li>
</ul>
<ul>
<li>For text, he points out that no major search engine uses a relational database so they didn&#8217;t even qualify for consideration.</li>
</ul>
<ul>
<li>For XML, he cites a private report I sent him a while back done for one of our customers comparing MarkLogic performance to a relational DBMS.  When on &#8220;our turf,&#8221; we usually win by no less than 10x and sometimes 100x or more.  Sometimes, queries are not even processable in an RDBMS and/or need to be hand-optimized and hand-joined between a DBMS and a search engine.</li>
</ul>
<p>He reduces to three cases how special-purpose DBMS vendors get their advantage:
<ul>
<li>A non-relational data model</li>
<li>A different implementation of tables</li>
<li>A different implementation of transactions</li>
</ul>
<p>We&#8217;re in the first category, using XML as our data model instead of a table.  It&#8217;s a great post.  <a href="http://cacm.acm.org/blogs/blog-cacm/32212-the-end-of-a-dbms-era-might-be-upon-us/fulltext">Check it out</a> and check out the cited references as well.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4461/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4461/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4461/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4461&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2009/07/02/stonebraker-send-relational-dbmss-to-the-home-for-tired-software/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>The Demise of Closed-Source RDBMSs?</title>
		<link>http://kellblog.com/2007/12/18/the-demise-of-closed-source-rdbmss/</link>
		<comments>http://kellblog.com/2007/12/18/the-demise-of-closed-source-rdbmss/#comments</comments>
		<pubDate>Tue, 18 Dec 2007 16:48:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[relational database]]></category>
		<category><![CDATA[special-purpose DBMS]]></category>
		<category><![CDATA[strategy]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/12/18/the-demise-of-closed-source-rdbmss/</guid>
		<description><![CDATA[A friend pointed me to this interesting post by Allan Packer of Sun entitled Are Proprietary Databases Doomed? Overall, I think it&#8217;s a well done analysis of the DBMS market and well worth reading. First, a nit. When I was &#8230; <a href="http://kellblog.com/2007/12/18/the-demise-of-closed-source-rdbmss/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4104&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A friend pointed me to this interesting post by Allan Packer of Sun entitled <a href="http://blogs.sun.com/allanp/entry/are_proprietary_databases_doomed">Are Proprietary Databases Doomed?</a> Overall, I think it&#8217;s a well done analysis of the DBMS market and well worth reading.</p>
<p>First, a nit. When I was a lad, &#8220;proprietary&#8221; didn&#8217;t mean &#8220;<a href="http://simple.wikipedia.org/wiki/Closed_source">closed source</a>&#8220;, it meant proprietary (i.e., vendor controlled) interface. For example, Ingres originally spoke a query language called <a href="http://en.wikipedia.org/wiki/QUEL_query_languages">Quel</a>. SQL then emerged as the standard and any DBMS that spoke a language other than ANSI standard SQL was deemed proprietary. While I know that <a href="http://community.sigames.com/eve/forums/a/tpc/f/893197497/m/4152026602">some people in the open source community view the opposite of &#8220;open source&#8221; as &#8220;proprietary</a>,&#8221; I think that&#8217;s a misnomer. I think the correct antonym is closed source.</p>
<p>First, I think Allan makes an excellent point about stagnation:<br />
<blockquote>By the turn of the millenium, relational databases had already pretty much met the essential requirements of end users, and proprietary database companies were either pointing their vaccuum cleaners toward other interesting money piles, or losing the plot entirely and sailing off the edge of the world. Today, database releases continue to tout new features, but they&#8217;re frosting on the cake rather than essentials. No-one issues a tender for a database unless they have unusual requirements. No-one loses their job because they chose the wrong database. And it&#8217;s been that way for years. </p></blockquote>
<p>As a general rule I am shocked by the lack of innovation returned by the R&amp;D budgets of most technology companies. As I mentioned yesterday, despite billions of R&amp;D investment, Google has yet to come up with another big business. And what does Microsoft get for the billions they spend each year on R&amp;D? An incompatible version of Office with irritating &#8220;ribbons&#8221; that takes four years to make.</p>
<p>Silicon Valley startups create new categories with $10s of millions in venture capital. It seems that once they become &#8220;real companies&#8221; they forget how to innovate at all, let alone on a shoestring.</p>
<p>Specifically in the DBMS market, I think the lack of innovation &#8212; enabled by the oligopolistic structure of the market &#8212; creates a soft underbelly for focused, innovative companies to carve our niches. (And remember &#8220;niches&#8221; of $10B market can be pretty big.)</p>
<p>Allan goes on to do some interesting pricing analysis, and then poses the question:<br />
<blockquote>
<p>Why, then, is proprietary database software becoming more expensive while everything else reduces in price? End users normally expect to benefit from the cost savings resulting from improvements in technology. I am writing this blog, for example, on an affordable computer that would easily outperform expensive commercial systems from just 10 years ago. </p>
<p>It seems difficult to resist the conclusion that proprietary database companies have managed to redirect a good chunk of these savings away from end users and into their own coffers. Successful as this strategy has been, though, it could ultimately backfire. The more expensive proprietary databases become, the more attractive lower cost alternatives appear. </p>
</blockquote>
<p>I think the short answer to his question is (1) the market is an oligopoly and (2) there is a lot of inertia when it comes to database management systems. So change will happen, but it will happen slowly. And, ironically, the force that drives the market change will be overpricing on the leaders&#8217; part. Were RDBMSs not so expensive, there would be less impetus to move to open source.</p>
<p>Now, the RDBMS vendors probably argue they should &#8220;milk&#8221; the market until the real threat emerges and then &#8220;wave a wand&#8221; to reduce price, but that is a risky strategy because they could very easily wave the wand too late, which is what I think they are doing.</p>
<p>The only point I think Alan misses in his analysis is that some powerful vendors like SAP and EMC don&#8217;t like the fact that their applications run on top of lower-level DBMS technologies from competitors. For example, SAP has been trying to get itself off Oracle for about a decade, and I&#8217;m told they fund developers to work on MySQL towards that end. I know that EMC/Documentum is not comfortable that the vendors who provide the DBMSs they run on are all now challenging them in content management (e.g., Oracle/Stellent, IBM/FileNet, Microsoft SharePoint).</p>
<p>He then speculates on what he thinks will happen going forward:<br />
<blockquote>My vote for the Strategy Most Likely To Succeed is a tie between Revenue Pull-Through and Reduce Prices. Oracle is arguably becoming the most successful proponent of the pull-through strategy. Oracle wants to supply you with a full software stack, including an OS, virtualization software, a broad range of middleware, a database, and end user applications. The largest component of Oracle&#8217;s revenue currently still comes from database licenses, but the company is working hard to reduce that dependency. Until that happens, reducing prices across the board will be challenging for Oracle. If Oracle succeeds with a pull-through strategy, it doesn&#8217;t mean that OSDBs will fail, of course. It simply means that Oracle is less likely to sustain major damage from their success.</p></blockquote>
<p>He concludes:<br />
<blockquote>Are proprietary databases doomed, then? Not at all. Even if proprietary database companies pull no surprises, they won&#8217;t fade away anytime soon &#8230; Make no mistake, though, open source databases are coming. For established companies it&#8217;s more likely to be an evolution than a revolution. </p></blockquote>
<p>I believe there are two major trends in the DBMS market today: (1) open-source chipping away at the closed-source oligopoly, and (2) special-purpose DBMSs innovating and carving out niches in the soft underbelly. I actually think point 1 provides powerful &#8220;air cover&#8221; for vendors pursuing strategy 2, because point 1 is a direct attack on the existing business.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/4104/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/4104/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4104/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4104/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4104/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4104&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/12/18/the-demise-of-closed-source-rdbmss/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
		<item>
		<title>Wall Street Journal Validates Rise of Special-Purpose Databases</title>
		<link>http://kellblog.com/2007/11/20/wall-street-journal-validates-rise-of-special-purpose-databases/</link>
		<comments>http://kellblog.com/2007/11/20/wall-street-journal-validates-rise-of-special-purpose-databases/#comments</comments>
		<pubDate>Wed, 21 Nov 2007 00:51:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[special-purpose DBMS]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/11/20/wall-street-journal-validates-rise-of-special-purpose-databases/</guid>
		<description><![CDATA[The Wall Street Journal recently ran an article entitled Start-Ups Mine Database Field: Nimble Software Helps Make Sense of Information Tide (subscription required for full text) that validates the imminent mainstream-ing of the top meme I write about on this &#8230; <a href="http://kellblog.com/2007/11/20/wall-street-journal-validates-rise-of-special-purpose-databases/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4082&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="http://davidkellogg.files.wordpress.com/2010/09/dd.jpg"><img class="alignright size-full wp-image-6728" title="Data Deluge dd" src="http://davidkellogg.files.wordpress.com/2010/09/dd.jpg?w=500" alt=""   /></a>The Wall Street Journal recently ran an article entitled <a href="http://online.wsj.com/article/SB119491593552290689.html">Start-Ups Mine Database Field:  Nimble Software Helps Make Sense of Information Tide</a> (subscription required for full text) that validates the imminent mainstream-ing of the top meme I write about on this blog:  the rise of special-purpose database management systems.</p>
<p>The article begins with:</p>
<p class="times">
<blockquote>
<p class="times">Most databases are based on technology that originated 30 years  ago. But change is in the air.</p>
<p class="times">A mob of start-ups have been developing variants of the software,  which provides the equivalent of filing cabinets for corporate information.  Customers say the offerings are generating faster answers to questions that  require sifting through huge volumes of business information.</p>
<p class="times">Established suppliers aren&#8217;t conceding much to the newcomers, but  industry executives agree the pace of progress is accelerating.</p>
<p class="times">&#8220;The database market is going to be an exciting place to be in  the next decade,&#8221; said Michael Stonebraker, an adjunct professor at the  Massachusetts Institute of Technology &#8230;</p>
</blockquote>
<p>My favorite excerpt includes this quote from chief database guru at Gartner, <a href="http://www.gartner.com/AnalystBiography?authorId=490">Donald Feinberg</a> (whose actual title is vice president and distinguished analyst).</p>
<p class="times">
<blockquote><p>Some predict specialized products will find a niche. &#8220;One kind of  database is not going to suit all of the different applications we are coming up  with,&#8221; said Donald Feinberg, an analyst at market researcher Gartner Inc.</p></blockquote>
<p>I couldn&#8217;t say it any better myself.  And &#8212; whack  &#8212; think about this.  You have the head database analyst at Gartner saying that one kind of database isn&#8217;t going to meet all needs.  That&#8217;s big.  And he&#8217;s right.</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/4082/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/4082/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4082/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4082/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4082/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4082&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/11/20/wall-street-journal-validates-rise-of-special-purpose-databases/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>

		<media:content url="http://davidkellogg.files.wordpress.com/2010/09/dd.jpg" medium="image">
			<media:title type="html">Data Deluge dd</media:title>
		</media:content>
	</item>
		<item>
		<title>The End of an Architectural Era</title>
		<link>http://kellblog.com/2007/10/22/the-end-of-an-architectural-era/</link>
		<comments>http://kellblog.com/2007/10/22/the-end-of-an-architectural-era/#comments</comments>
		<pubDate>Mon, 22 Oct 2007 19:51:00 +0000</pubDate>
		<dc:creator>Dave Kellogg</dc:creator>
				<category><![CDATA[special-purpose DBMS]]></category>
		<category><![CDATA[Stonebraker]]></category>

		<guid isPermaLink="false">http://test.kellblog.com/2007/10/22/the-end-of-an-architectural-era/</guid>
		<description><![CDATA[I picked up via this post on the High Scalability Blog a new paper by Michael Stonebraker, Nabil Hachem, and Pat Helland entitled The End of an Era (It&#8217;s Time for a Complete Rewrite) presented at the VLDB 2007 conference &#8230; <a href="http://kellblog.com/2007/10/22/the-end-of-an-architectural-era/">Continue reading <span class="meta-nav">&#8594;</span></a><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4048&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I picked up <a href="http://www.highscalability.com/paper-end-architectural-era-it-s-time-complete-rewrite">via this post</a> on the <a href="http://www.highscalability.com/">High Scalability Blog</a> a new paper by <a href="http://en.wikipedia.org/wiki/Michael_Stonebraker">Michael Stonebraker</a>, Nabil Hachem, and Pat Helland entitled <a href="http://web.mit.edu/dna/www/vldb07hstore.pdf">The End of an Era (It&#8217;s Time for a Complete Rewrite)</a> presented at the <a href="http://www.vldb2007.org/">VLDB 2007</a> conference in Austria on September 23rd through 27th.</p>
<p>From the paper&#8217;s summary:<br />
<blockquote>&#8220;In the last quarter of a century, there has been a dramatic shift in:</p>
<p>1. DBMS markets: from business data processing to a collection of markets, with varying requirements<br />2. Necessary features: new requirements include shared nothing support and high availability<br />3. Technology: large main memories, the possibility of hot standbys, and the web change most everything</p>
<p>The result is:</p>
<p>1. The predicted demise of “one size fits all”<br />2. The inappropriateness of current relational implementations for any segment of the market<br />3. The necessity of rethinking both data models and query languages for the specialized engines, which we expect to be dominant in the various vertical markets&#8221;</p></blockquote>
<p>As you know, I&#8217;m a big believer in the special-purpose DBMS meme.  Any database historian knows what Codd was thinking, and more importantly &#8212; what he wasn&#8217;t &#8212; when he designed the relational model.  Again, from the paper:<br />
<blockquote>&#8220;Ted Codd’s idea of normalizing data into flat tables has served our community well over the subsequent 30 years.  However, there are now other markets, whose needs must be considered.  These include data warehouses, web-oriented search, real-time analytics, and semi-structured data markets.&#8221;</p></blockquote>
<p>The complete paper can be found <a href="http://web.mit.edu/dna/www/vldb07hstore.pdf">here.</a></p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/davidkellogg.wordpress.com/4048/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/davidkellogg.wordpress.com/4048/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/davidkellogg.wordpress.com/4048/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/davidkellogg.wordpress.com/4048/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/davidkellogg.wordpress.com/4048/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=kellblog.com&#038;blog=11070789&#038;post=4048&#038;subd=davidkellogg&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://kellblog.com/2007/10/22/the-end-of-an-architectural-era/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Dave Kellogg</media:title>
		</media:content>
	</item>
	</channel>
</rss>
