计算传播学的日记

Socrates 2015-10-05 15:18:33
http://www.csdn.net/article/2014-06-06/2820111-100-Interesting-Data-Sets-for-Statistics/1

Socrates 2015-10-05 15:03:59
转载自:http://rensanning.iteye.com/blog/1601663 海量数据数据集 海量数据(又称大数据)已经成为各大互联网企业面临的最大问题,如何处理海量数据,提供更好的解决方案,是目前相当热门的一个话题。类似MapReduce、 Hadoop等架构的普遍推广,大家都在构建自己的大数据处理,大数据分析平台。 相应之下,目前对于海......

Socrates 2015-08-13 11:45:31
数据:http://wiki.dbpedia.org/news/dbpedia-version-2014-released Computational Fact Checking from Knowledge Networks http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0128193

Socrates 2015-05-14 10:05:49
https://kddcup2015.com/information-introduction.html Background: Students' high dropout rate on MOOC platforms has been heavily criticized, and predicting their likelihood of dropout would be useful for maintaining and encouraging students' learning activities. Therefore, in KDD Cup 2015, we will......

Socrates 2015-05-11 11:49:58
http://www.dtic.upf.edu/~ocelma/MusicRecommendationDataset/lastfm-360K.html Music Recommendation Datasets for Research L a s t . f m D a t a s e t - 3 6 0 K u s e r s http://www.benfrederickson.com/distance-metrics/

Socrates 2015-04-30 21:24:05
http://data.gdeltproject.org/events/index.html Supported by Google Ideas, the GDELT Project monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, counts, themes, sources, emotions, cou......

Socrates 2015-04-28 20:58:37
http://labs.criteo.com/downloads/2014-kaggle-display-advertising-challenge-dataset/ File descriptions train.csv - The training set consists of a portion of Criteo's traffic over a period of 7 days. Each row corresponds to a display ad served by Criteo. Positive (clicked) and negatives (non-clicke......

Socrates 2015-04-28 20:57:17
China Biographical Database Project (CBDB) http://isites.harvard.edu/icb/icb.do?keyword=k16229 Database Projects 费正清中国研究中心开放的三个数据 1. China Map Sponsored by the Lee and Juliet Folger Fund with support from the Fairbank Center for Chinese Studies and the Weatherhead Center for Inte......

Socrates 2015-04-21 09:37:38
http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm The Blog Authorship Corpus consists of the collected posts of 19,320 bloggers gathered from blogger.com in August 2004. The corpus incorporates a total of 681,288 posts and over 140 million words - or approximately 35 posts and 7250 words per person. ......

Socrates 2014-02-25 12:52:06
http://webscope.sandbox.yahoo.
http://webscope.sandbox.yahoo.com/catalog.php We have various types of data available to share. They are categorized into Ratings, Language, Graph, Advertising and Market Data, Computing Systems and an appendix of other relevant data and resources available via the Yahoo! Developer Network. Lang......

<前页 1 2 后页>