Looks like this event has already ended.

Check out upcoming events by this organizer, or organize your very own event.

View upcoming events Create an event

ScaleCamp

Tuesday, June 9, 2009 from 7:00 PM to 9:00 PM (PT)

Santa Clara, CA

Ticket Information

Type End     Quantity
Attendee for ScaleCamp   more info Ended Free  
SHARE THIS EVENT

Event Details

Scale Unlimited and Aster Data Systems present:

ScaleCamp at 7 pm on June 9, 2009 at the Santa Clara Marriott in Santa Clara, CA. It is the night before the Hadoop Summit, at the same location.

This is  a community event. We have speakers from Scale Unlimited, Aster, and more.

Our focus will be experience reports of projects leveraging technology from the Hadoop eco-system, Aster, Katta, Cascading, Voldemort etc.

There are three parallel tracks, after which we socialize and have general discussions.

Space is limited, so please register early at this free event. 

This event is open for everyone to speak. Please contact us here if you have an interesting Use Case to share.

For more information about ScaleCamp, visit the Scale Unlimited information page


Here is the live Speakers List, updated daily.

Ted Dunning - DeepDyve
Katta was developed as a shard-distributed form of Lucene in which the use of Zookeeper made shard management simple. At deepdyve, we have re-imagined katta as a completely generic shard manager for anything that needs reliable, scalable, distributed and aggregated operations against shards.

Stefan Groschupf - Scale Unlimited
Extracting information from the Web becomes a challenge when it need to scale. At EMI Music a stack of Hadoop, Cascading and the crawler toolkit Bixo is used to fetch and process a large set of websites that feeds into BI system.

David Fallside - IBM
We have created a number of proof of concept projects with media and financial services companies that use Hadoop, Nutch and related technologies. A notable feature of these projects is integration, both in the sense of data obtained from different sources, and in the sense of integration of Hadoop and visualization technologies. I intend to talk about two projects, a media company and a financial services company,  the applications we built around Hadoop for each of the projects, and show some of their output visualizations. I’ll include some speeds and feeds as well.

Paul Baclace
Visualizing Map-Reduce: A demo of interactive time-space diagrams that illustrate the performance of Hadoop Map-Reduce jobs.

Peter Pawlowski - Aster Data Systems
MapReduce Inside a Database System - When and How. Peter will discuss Aster Data's in-database MapReduce technology and present use cases where it complements other technologies like Hadoop.

Dr. DJ Patil - LinkedIn
Large-scale Analytics at LinkedIn: Dr Patil will discuss their current analytics framework and the methodology and technologies they are planning to put in place for future growth.

Alex Dorman - Contextweb
Alex will share ContextWeb experience of using Hadoop for frequent aggregation of data, ad performance optimizations, report generation and analytics.

Paco Nathan - ShareThis

Paco will discuss how ShareThis mashes technologies in the Cloud for Big Data analysis, leveraging the AsternCluster Cloud Edition, Amazon Elastic Mapreduce, Cascading and other AWS in their analytic system architecture.

Matt Ingenthron - NorthScale
Memory Caching to Scale: is it Different on a Public Cloud?
Many developers have grown to rely upon a distributed memory cache integrated in applications architectures to deliver interactive, responsive sites. When bringing these applications to cloud compute    environments, there can be challenges in throughput/latency and getting the desired elasticity out of the environment. Matt will review NorthScale's experience with distributed memory caches in public clouds and approaches to preserving the needed performance and scalability from the application's point of view.

Kevin Beyer - IBM
Kevin will present "Advanced data flow analytics in Jaql"

Jean-Daniel Cryans - Openplaces.org; Ecole De Technologie Superieure, Montreal
Jean-Daniel will talk about Hadoop at openplaces.org, a project to build the world's largest organized repository of maps, pictures, details, and advice about every place in the world. It runs right off a EC2-hosted HBase cluster and internally a 40 nodes cluster is used to batch process crawl data, indexing, named entity recognition, and other data mining tasks, with more than 50 MapReduce jobs. The presentation is about the experience of using Hadoop and HBase in such an environment and advices for those who would consider going the same path.

Rusty Burchfield and Doug Judd - Zvents
Rusty Burchfield and Doug Judd will co-present. Rusty will start by describing how the Zvents team uses scalable computing technology for analytics and reporting. They'll give a brief demo of some of the applications built using Hadoop, Hypertable and Cascading. Doug will finish with a brief overview of Hypertable and it's current status.