Thursday, December 5, 2013

Continuous integration - White paper from Zend

As programmers, it's always in our favor to learn from the experiences of others and maintaining quality work with planned efforts. I really find white paper from Zend useful in the context of continuous integration which is the core of agile software development approach and you can access it using following link:

https://drive.google.com/file/d/0ByBsxd4DRmCWdHhLbElFb3c4TFk/edit?usp=sharing

Feel free to share your experiences.

Enjoy programming!!!

Monday, May 27, 2013

The buzz word "Big Data"

"Big Data" has become really a buzz word for the professionals who are playing with data and want to know more about "Big Data" like:
  1. What the "Big Data" is?
  2. What's the use of "Big Data"?
  3. Analytics in the context of "Big Data"!!!
Below is the small discussion which I found really useful and will take you to the next level to explore "Big Data" :
  • When you were reading a book in the past, no external data was generated. If you now use a Kindle or Nook device, they track what you are reading, when you are reading it, how often you read it, how quickly you read it, and so on.
  • When you were listening to CDs in the past no data was generated. Now we listen to Music on your iPhone or digital music player and these devices are recording data on what we are listening to, when and how often, in what order etc.
  • Today, most of us carry smart phones and they are constantly collecting and generating data by logging our location, tracking our speed, monitoring what apps we are using as well as who we are ringing or texting.
  • Sensors are increasingly used to monitor and capture everything from temperature to power consumption, from ocean movements to traffic flows, from dust bin collections to your heart rate. Your car is full of sensors and so are smart TVs, smart watches, smart fridges, etc.
Finally, combine all this now with the billions of internet searches performed daily, the billions of status updates, wall posts, comments and likes generated on Facebook each day, the 400+ million tweets sent on Twitter per day and the 72 hours of video uploaded to YouTube every minute. I am sure you are getting the point. The volume of data is growing at a freighting rate. Google's executive chairman Eric Schmidt brings it to a point: "From the dawn of civilization until 2003, humankind generated five exabytes of data. Now we produce five exabytes every two days…and the pace is accelerating."

Not only do we have a lot of data, we also have a lot of different and new types of data: text, video, web search logs, sensor data, financial transactions and credit card payments etc. In the world of 'Big Data' we talk about the 4 Vs that characterize big data:
  1. Volume – the vast amounts of data generated every second
  2. Velocity – the speed at which new data is generated and moves around (credit card fraud detection is a good example where millions of transactions are checked for unusual patterns in almost real time)
  3. Variety – the increasingly different types of data (from financial data to social media feeds, from photos to sensor data, from video capture to voice recordings)
  4. Veracity – the messiness of the data (just think of Twitter posts with hash tags, abbreviations, typos and colloquial speech) So, we have a lot of data, in different formats, that is often fast moving and of varying quality – why would that change the world? The reason the world will change is that we now have the technology to bring all of this data together and analyze it.
In the past we had traditional database and analytics tools that couldn't deal with extremely large, messy, unstructured and fast moving data. Without going into too much detail, we now have software like Hadoop and others which enable us to analyze large, messy and fast moving volumes of structured and unstructured data. It does it by breaking the task up between many different computers (which is a bit like how Google breaks up the computation of its search function). As a consequence of this, companies can now bring together these different and previously inaccessible data sources to generate impressive results. Let's look at some real examples of how big data is used today to make a difference:
  • The FBI is combining data from social media, CCTV cameras, phone calls and texts to track down criminals and predict the next terrorist attack.
  • Facebook is using face recognition tools to compare the photos you have up-loaded with those of others to find potential friends of yours.
  • Politicians are using social media analytics to determine where they have to campaign the hardest to win the next election.
  • Video analytics and sensor data of Baseball or Football games is used to improve performance of players and teams. For example, you can now buy a baseball with over 200 sensors in it that will give you detailed feedback on how to improve your game.
  • Artists like Lady Gaga are using data of our listening preferences and sequences to determine the most popular playlist for her live gigs.
  • Google's self-driving car is analyzing a gigantic amount of data from sensor and cameras in real time to stay on the road safely.
  • The GPS information on where our phone is and how fast it is moving is now used to provide live traffic up-dates.
  • Companies are using sentiment analysis of Facebook and Twitter posts to determine and predict sales volume and brand equity.
  • Supermarkets are combining their loyalty card data with social media information to detect and leverage changing buying patterns.
  • A hospital unit that looks after premature and sick babies is generating a live steam of every heartbeat. It then analyses the data to identify patterns. Based on the analysis the system can now detect infections 24hrs before the baby would show any visible symptoms, which allows early intervention and treatment.
And these examples are just the beginning.

Companies are barely starting to get to grips with the new world of big data. In conclusion then, big data will change the world. In terms of language I prefer to talk about the 'datafication of the world' in relation to the ever-growing amounts of data and 'large-scale analytics' (or simply 'analytics' because what is large now will be normal tomorrow) in relation to our ability to analyze and harness big data.

Reference: http://www.linkedin.com/today/post/article/20130527063838-64875646-what-the-hell-is-big-data

Enjoy Programming!!!