|
It has been a while since I actively blogged on this personal site of ours. It has been a busy couple of years and our teams have pushed the boundaries of pretty much any technology out there that deals with Data and Analytics.
Some 4-5 years ago we started an internal project and based on Ray Kurzweil's - The SIngularity is Near - we dubbed it Singularity.
We are only weeks away from launching V3 of our Singularity platform and its nothing short of amazing. We set out to scale big, economical, make complex easy, do the impossible in the hands of all our analysts, without special training or knowledge of complex programming languages. Putting hundreds of trillions of behavioral patterns to use, structuring complex data just enough to make it simple to use, yet keep loosely structured patterns they way they are, storing unstructured data as is and project logic and structure at runtime.
|
|
Read more...
|
|
|
Written by Oliver Ratzesberger
|
|
Monday, 21 April 2008 00:00 |
|
Turning utility computing into a service model for analytics.
With the needs of Enterprise Analytics growing at ever increasing speeds, it becomes clear that traditional hub and spoke architectures are in no way able to sustain the demands driven by increasingly complex business analytics. As with any proliferation of systems the overhead of managing, maintaining and developing trees of increasingly complex dependencies quickly out paces the ability of an organization to deal with its challenges. What may work well at first turns into a real evolution nightmare.
|
|
Last Updated on Saturday, 12 February 2011 08:02 |
|
Read more...
|
|
xlmpp is a multi author blog about the latest trends in extreme large scale massive parallel processing (MPP). This site is not about products or vendors but about approaches, architecture, algorithms, the how, the what and most importantly: what to avoid, not to do. Extreme large data volumes present very unique challenges. Processing 100s-1000s of billions of records or rows or lines of text, whether inside a database or not, require not only massive parallel systems but a great amount of attention to detail. |
|
Read more...
|
|
|
To provide you with a little background of what types of systems we are working on we felt it would be beneficial to share some high level stats about our infrastructure. Incoming data volumes exceed 50TB per day, with more than 10^11 new items/lines/records being added per day. Our analytical processing infrastructure exceeds 12PB of physical storage with over 4.5PB in our largest cluster. We leverage compression technologies wherever possible and are achieving compression ratios as high as 96% on our highest volume data feeds. |
|
Read more...
|
|
|