12/2/12

Experiences with Big Data, NoSQL, Integration with Conventional Systems and more… Interview at IOD 2012


While at IBM’s Information on Demand 2012 Conference, between the talks and client meetings, I was interviewed by IBM’s developerWorks channel. You can see the interview here:



11/5/12

Developing High Performance Database Applications with pureQuery and IBM Data Studio


NoSQL data stores are in focus when it comes to high performance data applications. However, for many organizations relational databases are the mainstay of infrastructure, and we need to get them executing efficiently as well. In previous years, the enthusiasm around various object-relational mapping has been partly replaced with healthy skepticism. Explicit control over SQL became an important feature, especially when we are dealing with established databases that are not created from scratch by developers.

NoSQL data stores are in focus when it comes to high performance data applications. However, for many organizations relational databases are the mainstay of infrastructure, and we need to get them executing efficiently as well. In previous years, the enthusiasm around various object-relational mapping has been partly replaced with healthy skepticism. Explicit control over SQL became an important feature, especially when we are dealing with established databases that are not created from scratch by developers.

When working in a DB2 environment, pureQuery is a particularly attractive solution through its combination of explicit SQL control, high performance, and excellent tooling. This year I will be speaking on pureQuery and at the IBM Information on Demand 2012 conference in Las Vegas.

Update: The slides of the talk are now on the slideshare, but the really interesting part of the talk, not captured in the slides, is in the hands on demo, where we build a DB2 backed application in a matter of minutes and then profile and optimize SQL performance.





4/16/12

Big Data for Managers: A Tutorial at EDW 2012

Enterprise Data World conference introduces some exciting new topics this year. My favorites: BigData and NoSQL. If you are a manager and  did not have opportunity to explore Big Data and NoSQL topics so far, you may find useful the tutorial I will be teaching in Atlanta, on April 29th.



Big Data Technologies: A Manager's Guide

Big Data is the hottest topic in IT right now, but do you really know what it means to your organization? How does it impact your job? And when the CIO asks you whether to use Hadoop on an upcoming project, do you have a well-informed reply? This workshop will describe the Big Data landscape in terms that managers and architects can understand and translate for your organization. How is Big Data different from what you know now, and how is it the same? Why is Big Data taking the IT world by storm? What are the business and technological motives behind it? You’ll get a high-level view of the technologies involved, some of the prominent tools and languages (such as Hadoop and R), and by the end of the class will have a sufficient taxonomy that you can communicate with senior executives and developers.
  • How big is Big Data? 
  • Use cases and business drivers – technology costs and new analytical power 
  • Hardware and software components 
  • Key Big Data technologies and the functions they perform: 
  • Hadoop - Map Reduce - Hbase - Hive - Pig
  • NoSQL databases and data stores, such as MongoDB, Cassandra and Couchbase
  • R – the language of Big Data analytics 
  • Incorporating new information streams from social networks, sensors and unstructured data sources 
  • Integrating existing enterprise systems with Big Data
Check out the conference agenda and the tutorial here.

4/8/12

Win Friends and Influence People ...with DSLs




Using Domain Specific Languages to improve the speed and quality in software development is one of the areas that I pursue. In this talk at the EclipseCon 2012 my cospeakers and I share our experiences and learned lessons about use of DSLs on our joint project. We at SciSpike have been building DSL based solutions for a long time now, using various tools. Lately, our favorite toolkit is based on Eclipse Modeling Framework (EMF) and Xtext. One of the interesting aspects of this presentation is that it is not focusing solely on technology, but it addresses the impact of DSL development on the process.

We also discuss the impact of DSLs on development teams, their skills and discuss some obstacles we have encountered in the process. We conclude by discussing "Seven Habits of Successful DSL Projects".

Check out the slides here:

1/9/12

Introducing Hadoop and Big Data into a Traditional Data Organization: A True Story and Learned Lessons


As some of you may know, one part of my consulting and mentoring work revolves around data, particularly around BigData and NoSQL systems. Together with a client of mine we are building one such system for applications in healthcare. We are very excited that we got invited to speak at the Enterprise Data World 2012 conference in Atlanta. How could we refuse an opportunity to show of our work? Care to come to Atlanta and join us? For now, as a teaser, here is what are we going to talk about.
A real life story about introducing and integrating Hadoop, BigData and NoSQL into an organization to reduce cost and speed up the data processing. Come and learn how to make it work!

In this talk we will share our story about our journey into Hadoop, Big Data, Map Reduce, and NoSQL with the goal to reduce cost and improve speed of data processing. In our journey, we start with Hadoop and its Map Reduce algorithm that splits processing across many commodity machines. We found this to be an effective solution, albeit not without warts. From our experiences, you will learn how to effectively introduce Hadoop into a conventional data processing organization and about the approaches to integrate it not only with conventional data processing technologies, but also with people. As our appetite grew, we had to reach towards NoSQL storage.  You will learn about how to migrate from local deployment to the cloud.  We conclude with "7 Habits of Successful Hadoop Projects".

Topics
•    Hadoop and Big Data: why do we care?
•    Avoiding friction and integrating with relational databases
•    Preventing shock: the people issue
•    The buzz of Hive
•    When Hadoop alone is not enoughNext: into the Cloud!
•    7 Habits of Successful Hadoop Projects

I'll publish the slides when they become available after the conference. And now, excuse us, we have to attend our Hadoop cluster ;-)