All posts by Martin MC Brown

a.k.a.: Martin MC Brown a.k.a.: Martin Brown a.k.a.: mcslp a.k.a.: Martin C Brown a.k.a.: MC

Document Databases in Predictive Modeling

My latest article on performing predictive modeling using document databases is now available on IBM developerWorks. The abstract:

Predictive analytics relies on processing, analyzing data from many different sources, collating, and then processing that through several stages into usable data. This involves recording and storing data in different formats, and may require translating information into PMML. Despite the complexities and structure of the information, and the sources often involving data from traditional RDBMS data sources, other solutions offer some advantages. We can use the recent range of document-based NoSQL databases to help collate the information in a structured format, while coping with the flexible structure of the individual data points. Many NoSQL environments also provide support for extensive map reduce type queries and processing that makes them ideal for processing large volumes of data into a summary format. In this article, we’ll look at the transfer, exchange, and formatting of information in NoSQL environments.

Read Document databases in predictive modeling

 

Using Hadoop and Couchbase

My new article on using Hadoop with Couchbase is available now on the IBM developerWorks site. 

The article tells you how to integrate the massive map/reduce functionality offered by Hadoop with the query functionality offered in Couchbase.                                                                                                                            

With this article you also get a live demo of the process in action, and an intro video for the problems at hand we are trying to solve: 

Read: Using Hadoop with Couchbase

Fortunately the article was also chosen as a feature article for the entire developerWorks site, and came with call picture of an elephant sitting on a couch!


Using Hadoop and Couchbase

I've just had a new article published on IBM Developerworks that looks at how to use Hadoop and Couchbase together. This enables you to take advantage of the huge map/reduce functionality on very large datasets within Hadoop and combine that with the querying and indexing functionality in Couchbase Server 2.0 to enable you to pull out specific details. 

This effectively adds querying to Hadoop using the familiar map/reduce structure for both your big data processing at the front end, and your redcued data set once the information is stored into Couchbase. 

You can read the full article: https://www.ibm.com/developerworks/opensource/library/ba-hadoop-couchbase/

Permalink | Leave a comment  »

Getting Started with Couchbase Server Published!

Extreme Scalability at Your FingertipsHaving not published a book in years, I’ve just published my second book in as many months. 

Getting Started with Couchbase Server provides an overview of the main administration and user information that you need to install, setup, manage, and develop against Couchbase Server. 

In the book, I’ve tried to cover the installation, and then the main admin tasks, such as rebalancing and expanding your cluster, along with backups and restore. 

Also in the book is some basic information on developing an application, including using the main operations and how you store and retreieve information in the cluster. 

The book is on Couchbase Server 1.8, so no views, but there should be enough for you get going. 

Get more information here.

The Technology Behind Couchbase

Couchbase Server is one product, but it’s made up of a combination of different components that work together in order to produce server product, including memcached and the spidermonkey JavaScript engine, and more recently the CouchDB engine for storage and index creation. 

An article covering more of the detail is available here: 

http://blog.safaribooksonline.com/2012/03/01/the-technology-behind-couchbase/

Getting Started with CouchDB published!

I've just published my first book for a few years. The book is Getting Started with CouchDB with O'Reilly.

The book is a quick guide to everything you need to know to start working with and building applications with CouchDB. 

Some highlights of the contents:

  • Install CouchDB on Linux, Mac OS X, Windows, or (if you must) from the source code
  • Interact with data through CouchDB’s RESTful API, and use standard HTTP operations, such as PUT, GET, POST, and DELETE
  • Use Futon—CouchDB’s web-based interface— to manage databases and documents, and to configure replications
  • Learn how to create, update, and delete documents in JSON format, and how to create and delete databases
  • Work with design documents to get the formatting and indexing your application requires

 

Permalink | Leave a comment  »