Sign in or Join FriendFeed
FriendFeed is the easiest way to share online. Learn more »
Big Data

Big Data

Big Data, Cloud Computing, Parallel Computing, Data Mining & Analytics
TwitterBlogBlogBlog
Big Data
Hadoop World: Security and API Compatibility - http://www.cloudera.com/blog...
Big Data
A framework for thinking about data warehouse growth - http://www.dbms2.com/2009...
Mike Chelen
"Datahub is a tool that allows faster download/crawl, parse, load, and visualize of data. It achieves this by allowing you to divide each step into its own work folders. In each work folder you get a sample files that you can start coding in." - Mike Chelen from Bookmarklet
Big Data
Google Atmosphere Conference: Youtube videos of talks http://www.youtube.com/view_pl... #cloudcomputing #bigdata
Big Data
Webinar on MapReduce for complex analytics (Thursday, December 3, 10 am and 2 pm Eastern) - http://www.dbms2.com/2009...
Eric Borisch
"This example shows how OpenCL can be used to compute a Fast Fourier Transform. The techniques and algorithms used in this sample are described in Fitting FFT onto the G80 Architecture (Volkov and Kazian) and High Performance Discrete Fourier Tansforms on Graphics Processors (Govindaraju, Lloyd, et al)." - Eric Borisch from Bookmarklet
Big Data
Hadoop World: Hadoop for Bioinformatics - http://www.cloudera.com/blog...
Big Data
W/ data portability an issue in the Cloud http://www.informationweek.com/news..., Locust Storage & similar startups have an opening http://www.locust-storage.com/
Mike Chelen
Fwd: Reading about Lucandra (Lucene on Cassandra) by @tjake http://github.com/tjake... (via http://friendfeed.com/dacort...)
Big Data
Low-cost alternative to SAS: WPS SAS code interpreter. Too bad it's not free & open-source http://www.teamwpc.co.uk/product... (via http://www.information-management.com/blogs...)
Big Data
Hadoop World: Practical HBase from Jonathan Gray and Ryan Rawson - http://www.cloudera.com/blog...
Big Data
New England Database Summit (January 28, 2010) - http://www.dbms2.com/2009...
Big Data
SaaS BI/reporting now available (e.g. www.gooddata.com www.pivotlink.com). We also need SaaS stats+machine-learning (i.e. SaaS SAS)
Big Data
Hadoop World: Hadoop + Vertica from Omer Trajman - http://www.cloudera.com/blog...
Big Data
Comments on a fabricated press release quote - http://www.dbms2.com/2009...
Big Data
Boston Big Data Summit keynote outline - http://www.dbms2.com/2009...
Big Data
Hadoop World: Hadoop + Clojure from Stuart Sierra and Tim Dysinger - http://www.cloudera.com/blog...
Big Data
ACM Symposium on Cloud Computing (SOCC) - http://databeta.wordpress.com/2009...
Big Data
Hadoop World: Protein Alignment from Paul Brown - http://www.cloudera.com/blog...
Big Data
Hadoop at Twitter (part 1): Splittable LZO Compression - http://www.cloudera.com/blog...
Mike Chelen
Big Data
SAS partners w/ leading MPP databases to make analytic tools available in-database (no Oracle, Vertica, ParAccel) http://www.sas.com/news... #sas
Mike Chelen
Who is Ulam and what was his dilemma? - http://answers.yahoo.com/questio...
"His mind was too fast to write things down: if he stopped to write down a formula, he would miss his next thought, but if he did not write it down, he might forget what he had been thinking. If he had too many thoughts in his head, there would not be room for the new ones." - Mike Chelen from Bookmarklet
an apt analogy for data analysis, because there always exists far more data than can be recorded - Mike Chelen
Big Data
Source code for Hadoop Online Prototype http://code.google.com/p/hop/ (see Radar post on HOP) http://radar.oreilly.com/2009... #realtime #bigdata
Big Data
2 nascent Big Data resources: O'Reilly Answers http://answers.oreilly.com/tag..., & a Google Grp that I want to get off the ground http://groups.google.com/group...
Big Data
Counting unique users in real-time with streaming databases http://radar.oreilly.com/2009... #realtime #cep #streams
Big Data
Hadoop World: Rethinking the Data Warehouse with Hadoop and Hive from Ashish Thusoo - http://www.cloudera.com/blog...
Big Data
Hadoop World: Monitoring Best Practices from Ed Capriolo - http://www.cloudera.com/blog...
Big Data
Big Data
Pathologies of Big Data: Interesting R is "singled out", but there's an easy R work around, use a random sample http://cacm.acm.org/magazin... #rstat
Other ways to read this feed:Feed readerFacebook