Posts Tagged ‘Technology’

The James Bond of Datacenters

Conference room

It all started with a faint memory in the increasingly transient nature of the Internet. I vaguely remembered reading about a datacenter which was based somewhere near Stockholm and was built inside an (decommissioned) nuclear bunker. A little bit of googling later I realised that I had read about it on the Pingdom site. I thought to myself that since this is a conference where networking geeks converge, there is a good chance that some attendee knows the people in the Bahnhof ISP. So I sent this mail. Initially no one replied, but then the number of people interested in seeing the datacenter just ballooned. Two people stepped forward to arrange the visit during the lunch break at the conference and the CEO of Bahnhof, Jon Karlung personally gave guided us through this fantastic datacenter.

The entrance to the Datacenter

There is actually a house on top of that hill and there is pathway from the datacenter that opens up onto the top of the hill. The datacenter is actually dug out of hard rock (granite) as can be clearly seen from the pictures of the server floor at the end of the post. It was originally a military bunker and nuclear shelter during the Cold War era. The Code name from the military days Pionen White Mountains can be seen in the photos of the entrance to the datacenter.

Entrance to the White Mountain DC Entrance

The Backup Generators

2 Maybach MTU diesel engines that produce 1.5 Megawatt of AC power provide backup power. The engines were originally designed for German submarines. There is a warning horn from a German submarine that add to the effect :)

The Backup power room German Submarines Engine

The Conference Room

The Conference room and the pathway leading to it is made completely out of metal and glass and it hangs above the server flow adding tothe futuristic space staion look of the datacenter (as can be seen from the picture at the top of the post). There is also a Tintin theme rocket [ See Destination Moon for the Tintin Reference ]

The Tintin Rocket View from the bridge

The Fountains

There are lots of plants around the datacenter to reduce the claustrophobic feel of the bunker and make it like a more natural working environment. The Fountains at the entry also make up the decor of the place but are generally switched off as they make a lot of noise.

Another view of the Fountain The Fountain

The Netops and the Leisure room

This is Bahnhof’s biggest facility in Sweden and this Network operations room is used for running the ISP. The leisure room has a huge fishtank to add to natural feel of the place.

The LCDs in the NetOps room The NetOps room
The CEO and the DC manager The Leisure room

The Server Floor

Some of the walls of the server floor are unadroned and are made of bare rock giving away the initial use of the facility as a Cold war Era nuclear bunker.

Servers The Servers

Bahnhof uses the unique nature of the datacenter for marketing purposes. IT is actually possibe to co-locate your servers here. The Pionen Datacenter gives a whole new meaning to disaster recovery backup. :)

ACM Compute, Pycon India and Foss.in

With winter approaching fast we are also into conference season.

ACM Compute 2010 – 22nd & 23rd Jan, 2010.

ACM Bangalore chapter is organising ACM Compute 2010 which is into it’s third year now. This year the broad theme is Cloud computing and Information retrieval, management and analytics. The aim of this conference is to bring together researchers, practitioners, technology market movers, and thought leaders, with a view to advance the state of the art, and the state of the practice in applied research. This year we are planning to do something special – details soon :) The Call of papers (CFP) is out for sometime now and the last date for submissions is Oct 1 2009. You can also submit a proposal for a half-day or day long tutorial. Last year we had a bunch of good tutorials and also the symposium on Cloud Computing co-located with ACM Compute 2009 which was a great success.

Disclaimer: I am on the program committee of ACM Bangalore and am the secretary of the ACM Bangalore chapter.

Pycon India 2009 – 26th & 27th Sep 2009

Also this weekend (26th and 27th September 2009) India’s first Pycon India 2009 is being held in Bangalore. There is an interesting list of talks lined up. So do register if you are interested in attending.

Foss.in – December 1-5, 2009.

Foss.in shifts to a new venue this year NIMHANS convention centre. This year promises to be interesting as the venue is available for longer durations. Also there are going to be hacker evening/nights where tinkerers can meetup and talk about a whole range of stuff not restricted to just FOSS. My educated guess is something on the lines of CCC in Germany. Definitely something to look forward to. Plus I think there will be atleast one evening where we will have music :) . So join the mailing list if you are interested in presenting/attending as more details should emerge soon.

Supercrunchers – How Data Analysis is Changing our Lives

supercrunchers

I had been planning to read Supercrunchers by Ian Ayres for a while. The theme of the book is how you can educe information from raw data by applying various statistical techniques such as regression analysis and corelation algorithms. The author calls this process as “Supercrunching”.

The book starts with an introduction to supercrunching with the example of Orley Ashenfelter who devised a mathematical formula to determine the quality of wine (and hence it’s price). Wine connoisseurs initially dismissed the idea that a formula could beat their intuition and years of experience but the formula has stood the test of time and outperformed the experts in the field by a large margin. Another example is the loss of Kasparov to Deep Blue but the author fails to mention that Kasparov not only lost because of the huge amount of data that Deep Blue had at it’s disposal but also the team of scientists who were continually refining the algoritms to get to the right moves.

The first chapter ‘Whos doing the thinking for you ?’ dwells on the topic of recommendation engines such as the Netflix, Amazon, Last.fm and Pandora. (IEEE spectrum article on the winning algorithm by the current leaders of the Netflix prize). Recommendation engines and collaborative filtering has become almost as norm on the web today with almost every news site (most emailed, most read, most shared), shopping site (people who bought this also bought … ), media sites (recommendations for artists, songs, albums) and social networking sites (such and Linkedin and Facebook’s ‘People you may know’ feature) having such features built-in. A slightly different example in the book is that of Walmart which uses answers to certain questions to filter out (non-conformist) candidates for certain job profiles based on the data they have gathered from current employees. Internet search giant Google recently began crunching data from employee reviews and promotion and pay histories in a mathematical formula which it says can identify which of its 20,000 employees are most likely to quit. Fraud detection is another area which this chapter skims but you can write a book on just data mining and fraud detection.

The second chapter ‘Creating your own Data with a Flip of a Coin’ deals with testing hypotheses by running random trials to test the efficacy of a prediction especially in cases in which the cost of running the trial is low. Lots of companies are known to use this. For example, when Google or Yahoo change their front page or layout, different users see different versions of the page (which are slightly different from each other often in one single aspect) and the user behavior is then mapped and ranked based on a whole bunch of criteria. This goes on in several iterations till the design is perfected. This technique is called A/B testing. The efficacy of trail run can be improved by using the Taguchi methods. I found the author’s approach to using randomised trials to find answers similar to what Nassim Taleb’s advocacy of using ‘Monte Carlo Methods‘ in ‘Fooled by Randomness’ for random sampling.

The third chapter ‘Government by chance’ elucidates how randomised trial is helping change and improve social policies. Examples include Mexico’s Progresa initiative and MIT’s Poverty action Lab.

The chapter ‘How should physicians treat evidence based medicine ?’ shows how evidence-based medicine is profoundly changing medical practices. The most interesting story in the chapter is of Ignaz Semelweiss who used statistical techniques to figure out that chances of transmitting puerperal fever (a form of septicaemia) could be prevented by having doctors wash their hands on chlorinated solutions. Unfortunately he was not only ridiculed by many, admitted to an asylum and an in an ironic twist of fate died due to septicaemia only a fortnight later. All this before Louis Pasteur developed the germ theory of disease. The chapter also relates stories of how rule based system such as Isabel are helping avoid misdiagnosis of patients and assisting doctors to help treat patients faster.

The chapter ‘Experts Versus Equations’ expands on the underlying and subtle theme of the book – that supercrunching is slowly but surely defeating the experts in various fields (given accurate and adequate data) and how experts’ intuition is not always right. It goes on to explain how biases and emotions cloud our judgement and reduce the accuracy of our predictions. (A good aside at this point is reading up on Positive confirmation Bias – our tendency to search for data that confirms out perception and Cognitive dissonance – the uncomfortable feeling caused by holding two contradictory ideas simultaneously). The example given is how a very simplistic algorithm outperformed legal experts in determining how the supreme court justices of the US voted.

In the chapter ‘Why Now ?’, the author makes a compelling case of why supercrunching is becoming more and more relevant and accessible to ordinary people due to the rise in data crunching ability (Moore’s Law), falling cost of storage for large datasets (Kryder’s law) and ease of distribution and verification (The Internet). The author introduces neural network in this chapter and relates the story of the Epagogix – A firm that uses neural networks to improves the commercial gross of a film by tweaking movie scripts. Yes – unbelievable but who knew. It gives a whole new meaning to formulaic films. :)

In the chapter ‘Are we having fun yet ?’ the author touches upon the topic of education and how direct instruction is changing education in America. This was probably one of the most counterintuitive examples in the book. Direct Instruction replaces the discretion of the teacher with a behavioral script for teaching students. Teachers have resisted this method of teaching despite overwhelming evidence that this is the best way for teaching students. Again the conflict between intuition and hard numbers crops up in this chapter. Another theme in this chapter is how megacorps are using data (such as our buying decisions & browsing behavior) generated us to make decisions to increase their profitability. A recent NYTimes artcile touches upon this topic as well. (What Does Your Credit-Card Company Know About You?)

The Final Chapter ‘The Future of Intuition (and Expertise)’ explains some basic statistics in layman terms such as the 2SDrule, Bayes theorem and margin of error in statistical surveys. I wish this section were more extensive though.

Overall a good read (3/5), but I wish it had more examples and were more comprehensive. Ian Ayres also writes on the freakonomics blog. Also I recently read that IBM is working on software to analyze trends based on realtime data.

More @ Google Talk by Ian Ayers.

Cloud Camp Bangalore

CloudCamp Bangalore
The Clouds Above at Pangong Tso, Ladakh More photos here

The first CloudCamp in India is happening on Sunday, March 29th in Bangalore, India. ACM Bangalore is supporting this CloudCamp.

CloudCamp is an unconference where early adopters of Cloud Computing technologies exchange ideas. With the rapid change occurring in the industry, we need a place we can meet to share our experiences, challenges and solutions. At CloudCamp, you are encouraged you to share your thoughts in several open discussions, as we strive for the advancement of Cloud Computing. End users, IT professionals and vendors are all encouraged to participate.

One track will feature invited speakers from early adopter startups, CloudComputing Vendors and Developers. Register below.

CloudCamp Bangalore

If you are interested in sponsoring or submitting a proposal for a talk, send an email to vinayakh AT gmail DOT com.

Some of the proposed sessions are “A introduction to Cloud Computing” by Dave Nielsen, “How to build a Search Engine using AWS” by Chirayu Patel and “How to use Cloud computing to build a MMORPG” by Arjun Gupte.

Tentative Agenda:

Doors open at 10:00am
10:00am: Talks (2 startups on their experience of using clouds)
11:00am: Expert Talk (a survey of cloud platform products – Amazon, Sun, Google, etc)
11:30am: Expert talk (case studies of app architectures that use clouds)
12:15pm: Break for unconference
12:30pm: Unconference session 1
1:30pm: Lunch
2:30pm: Unconference session 2
3:30pm: Unconference session 3
4:30pm: Quick snacks
5:00pm – 6:00pm: Panel discussion and close

97 Things Every Software Architect Should Know

97 Things every sofware architect should know

I recently contributed to a book called ‘97 Things every sofware architect should know‘. It is a collection of axioms by various architects around the world including some famous names such as Allison Randal (The lead developer for Parrot), Bill de hÓra (Co-editor of Atom publishing protocol) , Michael Nygard (who wrote “Release It! Design and Deploy Production-Ready Software” – a 2008 Jolt Productivity Award Winner), Neal Ford (who wrote “The Productive Programmer”) and Rebecca Parsons (Thoughtworks CTO)

The book is now available for purchase from Amazon. The content of the book is licensed under Creative Commons Attribution 3 license. You can read it on the wiki if you are not inclined to buy the dead-tree version.

The book is edited by Richard Monson-Haefel and Mike Loukides. Interestingly the Amazon editorial review quotes the axiom that I have contributed (“For the End-user,the interface is the System”). Unfortunately there was a limit of 300 words for the axioms and no companies or products could be mentioned as the editors wanted a certain timeless quantity for the axioms (and rightly so). I plan to explain what I meant in greater detail in a blog post soon.

Some more interesting trivia about the book.
* Currently the book has 5 star rating from 3 reviewers.
* It is #1 in the “Design and Architecture” Category and #5 in The Software Development Category in a short time frame. (It’s been about 2 weeks since it was released.)

The cover has the photos of every author (whose contribution was accepted) on it. My photo is 2nd from the leftt on the last row :) .

Get Adobe Flash playerPlugin by wpburn.com wordpress themes