Crowd crawling: Towards Collaborative Data Collection for Large-scale Online Social Networks

Cite this publication

​Crowd crawling: ​Towards Collaborative Data Collection for Large-scale Online Social Networks​
Ding, C.; Chen, Y.   & Fu, X. ​ (2013)
pp. 183​-188. ​COSN​, Boston, Massachusetts, USA.
COSN. DOI: https://doi.org/10.1145/2512938.2512958 

Documents & Media

License

GRO License GRO License

Details

Authors
Ding, Cong; Chen, Yang ; Fu, Xiaoming 
Abstract
The emerging research for online social networks (OSNs) requires a huge amount of data. However, OSN sites typi- cally enforce restrictions for data crawling, such as reques t rate limiting on a per-IP basis. It becomes challenging for an individual research group to collect sufficient data by using its own network resources. In this paper, we intro- duce and motivate crowd crawling , which allows multiple re- search groups to efficiently crawl data in a collaborative way . Crowd crawling is carefully designed by addressing several practical challenges including resource diversity of diffe rent partners, strict request rate limiting from OSN providers, and data fidelity. We implemented and deployed a crowd crawling prototype on PlanetLab, and demonstrated its per- formance through evaluations. We have made the datasets crawled in our evaluation publicly available.
Issue Date
2013
Publisher
COSN
Conference
COSN
ISBN
978-1-4503-2084-9
Conference Place
Boston, Massachusetts, USA
Event start
2013-10-07
Event end
2013-10-08
Language
English

Reference

Citations


Social Media