NoSQL Now! 2013 Conference

This week I am in San Jose, California for the 2013 edition of the NoSQL Now! conference. This is the third annual NoSQL Now! Conference, which bills itself as the largest vendor-neutral forum focused on NoSQL technologies.

But what is NoSQL? I am not an advocate of naming anything based on what it is not, and that is exactly what NoSQL does… or perhaps I should say did. NoSQL used to be two words munged together describing the technology, that is No + SQL = NoSQL. But these days the NoSQL crowd bills itself as Not Only SQL. That is wise because technologies like Hive bring SQL-like functionality to NoSQL database systems. Once you try to use NoSQL databases without SQL, instead coding MapReduce logic in Java to access data for example, you will be thrilled for the simplicity of SQL-like access to your data.

But I digress. The conference offers a great way to learn about the various types of NoSQL database systems (document stores, graph databases, key/value data stores, columnar databases, in-memory data grids, streaming solutions, and so on). The educational sessions on Tuesday, the first day of the conference, were half-day sessions that focused on in-depth dives into technologies like Hadoop and graph databases.

The first session I attended provided a wealth of information on Hadoop and was worth my attending the conference. The session, Introduction to the Hadoop Ecosystem, by Dr. Vladimir Bacvanski (of SciSpike) offered a fantastic overview of Hadoop and related technologies. Some of the relevant takeaways from the session included:

  • The mindset of Hadoop is that storage of data is very cheap… and processing is cheap, too.
  • The idea is to move the job to the data, not the data to the job… or stated another way, “Don’t move petabytes to the program, but the program to the petabytes”
  • When you deal with Hadoop you generally store the data in its original format instead of breaking it apart or filtering it.

Of course, Dr. Bacvanski described MapReduce (the algorithm), Hadoop (the implementation of the algorithm), and the general environment (multiple, redundant nodes for processing large amounts of data). MapReduce is basically two steps:

  1. the “Map” step where input is split into pieces with nodes processing these pieces in parallel and stores the results of the “query.”
  2. the “Reduce” step where the mapped data is aggregated by worker nodes under the control of the Job Tracker

He also discussed other aspects of the Hadoop ecosystem including HDFS, Hive, Pig, HBase, HCatalog, Zookeeper, Sqoop, Flume, and more. The speaker covered some of the more common Hadoop systems like Cloudera, Hortonworks, and IBM BigInsights (among others), and closed with a nice introduction to stream computing.

The second half-day session I attended was interesting, but a little more disjointed and difficult to follow than the first. This session was titled NoSQL Database Patterns and was delivered by Srini Penchikala of InfoQ. The session was more application development oriented and covered the architecture and design pattern of NoSQL database systems.

Penchikala discussed popular application development frameworks like Spring Data, Spring Integration and others, showing how to implement applications with polyglot persistence. What is polyglot persistence, you ask? Well, it is basically a fancy (and confusing) term for storing data in multiple database systems while accessing them, as needed, via a framework that masks their differences.

The highlight of this session was the several demos that the presenter walked through using the Spring Data framework against different NoSQL data stores.

Finally, the day ended with a series of 5 minute lightning talks on a variety of topics, including quantum computing, data quality, and FoundationDB – a new NoSQL database systems announced for general availability today (August 20, 2013) that combines NoSQL and ACID. Better yet, FoundationDB is offering a free community license for the new offering.

I’m looking forward to day two of this conference, which is more along the lines of a traditional conference with multiple 60 minute sessions throughout the day… and the opening of the exhibition area where a bevy of NoSQL vendors will promote and demo their offerings.

Be sure to tune in tomorrow for my coverage of Day Two of NoSQL! Now…

About craig@craigsmullins.com

I'm a data management strategist, researcher, and consultant with over three decades of experience in all facets of database systems development and implementation.
This entry was posted in Big Data, NoSQL. Bookmark the permalink.

9 Responses to NoSQL Now! 2013 Conference

  1. Pingback: DB2 Hub | NoSQL Now! 2013 Conference

  2. Pingback: NoSQL Gets Me Thinking About ACID - IBM DB2 - IBM DB2 - Toad World

  3. Informative stuff, Big data is the future…

  4. get smart says:

    NoSQL defines a broad class of database management systems. One would expect the class defined by the term “No SQL” to exclude any database that makes use of SQL style relational databases. However, the term has evolved and now stands for “Not Only SQL” and covers a wide range of technologies. NoSQL represents a movement or school of thought away from the relational database model and toward distributed, object oriented, document based storage. Selecting the correct NoSQL database from this emerging group of technologies presents a difficult challenge. With 27 NoSQL Databases to choose from, we can help you find the best one for your needs.

  5. As its name implies, NoSQL is not an SQL database but rather a shell-level tool, as explained in Philosophy of NoSQL . NoSQL data is contained in regular UNIX ASCII files, and so can be manipulated by regular UNIX utilities, e.g. ls, wc, mv, cp, cat, head, more, less, editors like ‘vi,’ etc., as well as by powerful versioning systems, such as RCS and CVS.

  6. www.dmzforum.org says:

    What’s up, just wanted to tell you, I enjoyed this
    article. It was funny. Keep on posting!

  7. SQIAR BI says:

    its give a lots of information

  8. click here says:

    Thanks for writing about the NoSQL Now! 2013 Conference | Data and Technology Today — Loved it!

Leave a comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.