Sign in or Join FriendFeed
FriendFeed is the easiest way to share online. Learn more »
Mike Chelen
Mike Chelen
Fwd: CDH2: Cloudera’s Distribution for Hadoop 2 - http://www.cloudera.com/blog... (via http://friendfeed.com/mndoci...)
Amund Tveit
Case Study: Using Hadoop to help people write - http://aws.amazon.com/solutio...
Mike Chelen
Mike Chelen
Fwd: Analyzing Human Genomes with Hadoop » Cloudera Hadoop & Big Data Blog - http://www.cloudera.com/blog... (via http://friendfeed.com/the-lif...)
Deepak Singh
Clojure+Hadoop Slides - Digital Digressions by Stuart Sierra - http://stuartsierra.com/2009...
Stuart Sierra's clojure + Hadoop slides - Deepak Singh from Bookmarklet
hadi
I am beginner in hadoop, what linux is better for clients on hadoop?
Mike Chelen
"Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster." - Mike Chelen from Bookmarklet
jeff hammerbacher
Hadoop World NYC is Oct 2: http://www.cloudera.com/hadoop-.... Learn how enterprises like JP Morgan, Visa, eBay, IBM, Booz Allen, and more are using Hadoop.
jeff hammerbacher
Mike Chelen
Hadoop computes the 10^15+1st bit of π (Hadoop and Distributed Computing at Yahoo!) - http://developer.yahoo.net/blogs...
"I used Yahoo's Hadoop clusters to compute the 1,000,000,000,000,001st bit of π. The 7 hexadecimal digits of π starting at the 10^15+1 bit are: 6216B06" - Mike Chelen from Bookmarklet
Mike Chelen
"The Hadoop project is extremely important to us here at Yahoo!. We run the world's largest Hadoop clusters, work with academic institutions and other large corporations on advanced cloud computing research and our engineers are active participants in the Hadoop community." - Mike Chelen from Bookmarklet
jeff hammerbacher
Fwd: How Hadoop Revolutionized Data Warehousing at Yahoo and Facebook - http://www.slideshare.net/awadall... (via http://friendfeed.com/amund...)
Alexis Lê-Quôc
New Tech Books
Hadoop - The Definitive Guide #hadoop http://www.amazon.com/dp...
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters. Complete with case studies that illustrate how Hadoop solves specific problem. - New Tech Books
Handed out at Hadoop Summit. Thorough and articulate. A worthwhile read. - Alexis Lê-Quôc
Deepak Singh
Notes from the 2009 Hadoop Summit West « Scale or die - http://scaleordie.com/2009...
Sidharth Shah
Mochi is a visual, log-analysis based debugging tool correlates Hadoop's behavior in space, time and volume, and extracts a causal, unified control- and data-flow model of Hadoop across the nodes of a cluster. - Sidharth Shah
Yingfeng Zhang
Has any one compared performance between MPI and mapreduce(such as hadoop) on scientific computing such as clustering?
IMO best used for different kinds of parallelism. Hadoop is very data parallel and MPI works better (in my limited experience) for task parallel jobs. Also MPI scales terribly. I'd like to see it being replaced by better frameworks for task parallelism - Deepak Singh
Sidharth Shah
Hadoop should target C++/LLVM, not Java (because of watts). http://www.trendcaller.com/2009...
Interesting question, but again no data to support it. - Sidharth Shah
Mike Chelen
HadoopStreaming - Hadoop Wiki - http://wiki.apache.org/hadoop...
utility which allows users to create and run jobs with any executables (e.g. shell utilities) as the mapper and/or the reducer - Mike Chelen from Bookmarklet
Sidharth Shah
Improving MapReduce Performance in Heterogeneous Environments - http://www.usenix.org/events...
Mike Chelen
Sidharth Shah
VideoMosaic is a set of Java classes capable of generating a series of photomosaics from frames of a video. The project was implemented under the context of my final project for Distributed Systems at the University of Washington. The Java implementation is built for rendering on Hadoop cluster. - Sidharth Shah
Sidharth Shah
Graceful shutdown, Hadoop, and black magic - http://blog.rapleaf.com/dev...
Sidharth Shah
Redpoll is a distributed machine learning library written in java. It works by the power of hadoop, which is an open source implementation of google's MapReduce computing Model. We intent to parallelize some traditional classification, clustering algorithms like Naive Bayes, K-Means, EM so that can deal with large-scale data sets. It's Apache 2.0 licensed - Sidharth Shah
Elias Torres
[#HADOOP-4012] Providing splitting support for bzip2 compressed files - ASF JIRA - https://issues.apache.org/jira...
Will Cloudera do this? ;-) - Elias Torres from Bookmarklet
Hmm, we should discuss... - jeff hammerbacher
Sidharth Shah
This isn't exactly about Hadoop, but nice interview with creators of MapReduce. - http://research.google.com/roundta...
Shawn Hsiao
Insightful summary of the current state of hadoop - Shawn Hsiao
Shawn Hsiao
Implementing core scheduler functionality in Resource Manager (V1) for Hadoop - https://issues.apache.org/jira...
still read up on this topic, but without a good scheduler the cluster throughput solely depends on the operators of the cluster - Shawn Hsiao
True story, Shawn. If you're interested in helping solve this problem, drop me a line. - jeff hammerbacher
Sidharth Shah
A Map Reduce Framework for Programming Graphics Processors - http://web.mit.edu/rabbah...
can anyone point out some applications of Hadoop framework (Mapreduce programming model + HDFS storage) in computational finance? what kind of algorithms in computational finance is suitable MapReduce programming model (I know MPI does most of the existing work)? - platformgeek
platformgeek: what problems in computational finance are you trying to solve? - Amund Tveit
platformgeek: drop me a line (jeff.hammerbacher@gmail.com) and i'll let you know a few. i'm curious to know what is sparking your interest in using hadoop for computational finance. - jeff hammerbacher
Other ways to read this feed:Feed readerFacebook