Efficiently querying archived data using hadoop
Rajeev Gupta, Himanshu Gupta, et al.
CIKM 2010
Data delivered over the Internet is increasingly being used to provide dynamic and personalized user experiences. Queries over fast-changing data from distributed data sources are executed to create content to be delivered to users. Because these queries require data from multiple sources, they're executed at intermediate proxies or data aggregators. The authors discuss various techniques for executing aggregation queries over distributed data to minimize the number of message exchanges between data sources, aggregators, and users. They carefully examine the problem in terms of different types of queries, aggregation functions, query imprecisions, and whether the aggregators get data from sources using pull- or push-based mechanisms. © 2006 IEEE.
Rajeev Gupta, Himanshu Gupta, et al.
CIKM 2010
Natwar Modani, Rajeev Gupta, et al.
ICDEW 2007
Manoj K Agarwal, Krithi Ramamritham, et al.
VLDB
Kedar Khandeparkar, Pratik Patre, et al.
e-Energy 2014