Sign in or Join FriendFeed
FriendFeed is the easiest way to share online. Learn more »
Modular toolkit for Data Processing (MDP) - http://mdp-toolkit.sourceforge.net/
Modular toolkit for Data Processing (MDP) is a Python data processing framework. - anand
map reduce via Bash - anand
Trivers-Willard hypothesis - Wikipedia, the free encyclopedia - http://en.wikipedia.org/wiki...
if a heritable attrbute is more beneficial to chldren of one sex thn parents wil bear more offspring of tht sex - anand
ALPAC: the (in)famous report - http://www.hutchinsweb.me.uk/ALPAC-1...
ALPAC: the (in)famous report - anand
ALPAC (Automatic Language Processing Advisory Committee) - http://en.wikipedia.org/wiki...
InputFormat (Hadoop 0.18.2 API) - http://hadoop.apache.org/core...
Number of map processes for input - anand
JINR (ISSN 1916-7423) is an electronic journal, with a printed version to be negotiated with a major publisher once we have established a steady presence. The journal will bring to the fore research in Natural Language Processing and Machine Learning that uncovers interesting negative results. - anand
Read It: Search User Interfaces - http://searchuserinterfaces.com/book...
Search User Interfaces by Marti Hearst - anand
Evaluating POS Taggers: The Contenders - http://workproduct.wordpress.com/2008...
Evaluating POS taggers: Speed - http://workproduct.wordpress.com/2008...
An Impossibility Theorem for Clustering - http://www.cs.cornell.edu/home...
Although the study of clustering is centered around an intuitively compelling goal, it has been very difficult to develop a unified framework for reasoning about it at a technical level, and pro- foundly diverse approaches to clustering abound in the research community. Here we suggest a formal perspective on the difficulty in finding such a unification, in the form of an impossibility theo- rem: for a set of three simple properties, we show that there is no clustering function satisfying all three. Relaxations of these prop- erties expose some of the interesting (and unavoidable) trade-offs at work in well-studied clustering techniques such as single-linkage, sum-of-pairs, k -means, and k -median. - anand
Java Wordnet Similiarity Library for JWI - anand
Retrieving collocations from text: Xtract - http://acl.ldc.upenn.edu/J...
Home - Common Tag - http://www.commontag.org/Home
Common Tag is an open tagging format developed to make content more connected, discoverable and engaging. Unlike free-text tags, Common Tags are references to unique, well-defined concepts, complete with metadata and their own URLs. With Common Tag, site owners can more easily create topic hubs, cross-promote their content, and enrich their pages with free data, images and widgets. - anand
Announcing the Yahoo! Distribution of Hadoop - http://developer.yahoo.net/blogs...
details about the nltk wordnet interface - anand
phpsyntaxtree - Google Code - http://code.google.com/p...
phpSyntaxTree is a web application that creates syntax tree graphs from phrases entered in labelled bracket notation. phpSyntaxTree generated graphs can be used in linguistic homework, assignments and other documents. - anand
Every Move You Make: Free Smart Phone App Helps Burn Calories - http://www.uh.edu/news-ev...
tricube function - LOESS - http://www.unc.edu/courses...
explanation of the tricube weight function - anand
Streaming for large scale NLP: Language Modeling - http://hal3.name/docs...
In this paper, we explore a streaming algorithm paradigm to handle large amounts of data for NLP problems. We present an efficient low-memory method for constructing high-order approximate n-gram frequency counts. The method is based on a deterministic streaming algorithm which efficiently computes approximate frequency counts over a stream of data while employing a small memory footprint. - anand
tragedy of the commons - http://www.sciencemag.org/cgi...
Boosting is a general method for producing a very accurate classification rule by combining rough and moderately inaccurate "rules of thumb." While rooted in a theoretical framework of machine learning, boosting has been found to perform quite well empirically. This tutorial will introduce the boosting algorithm AdaBoost?, and explain the underlying theory of boosting, including explanations that have been given as to why boosting often does not suffer from overfitting, as well as some of the myriad other theoretical points of view that have been taken on this algorithm. Some recent applications and extensions of boosting will also be described. - anand
Chapter 6. Replication in MySQL - http://www.cit.gu.edu.au/doc...
Twitter Data - A simple, open proposal for embedding data in Twitter messages - Home - http://twitterdata.org/
a simple, open proposal for embedding data in Twitter messages - anand
Taking a New Look at Health : GE - http://www.ge.com/visuali...
Taking a New Look at Health What are the major health issues facing Americans today? What are some of the most common conditions, and how are they related to one another? What can we do to improve our health? - anand
Artificial Intelligence | Natural Language Processing | Stanford School of Engineering - http://see.stanford.edu/see...
This course is designed to introduce students to the fundamental concepts and ideas in natural language processing (NLP), and to get them up to speed with current research in the area. It develops an in-depth understanding of both the algorithms available for the processing of linguistic information and the underlying computational properties of natural languages. Wordlevel, syntactic, and semantic processing from both a linguistic and an algorithmic perspective are considered. The focus is on modern quantitative techniques in NLP: using large corpora, statistical models for acquisition, disambiguation, and parsing. Also, it examines and constructs representative systems - anand
Jerry Yang commencement speech - http://ycorpblog.com/files...
Video of airline travelers’ tweets » VentureBeat - http://venturebeat.com/2009...
Are these search query trends hinting at something? coincidence? maybe not - what do you think? - http://trends.google.com/trends...
Other ways to read this feed:Feed readerFacebook