Select Lab Publications


Distributed Regression: an Efficient Framework for Modeling Sensor Network Data (2004)

By: Carlos Guestrin, Peter Bodik, Romain Thibaux, Mark Paskin, and Samuel Madden

Abstract: We present distributed regression, an efficient and general framework for in-network modeling of sensor data. In this framework, the nodes of the sensor network collaborate to optimally fit a global function to each of their local measurements. The algorithm is based upon kernel linear regression, where the model takes the form of a weighted sum of local basis functions; this provides an expressive yet tractable class of models for sensor network data. Rather than transmitting data to one another or outside the network, nodes communicate constraints on the model parameters, drastically reducing the communication required. After the algorithm is run, each node can answer queries for its local region, or the nodes can efficiently transmit the parameters of the model to a user outside the network. We present an evaluation of the algorithm based upon data from a 48-node sensor network deployment at the Intel Research - Berkeley Lab, demonstrating that our distributed algorithm converges to the optimal solution at a fast rate and is very ro bust to packet losses.



Download Information
Carlos Guestrin, Peter Bodik, Romain Thibaux, Mark Paskin, and Samuel Madden (2004). "Distributed Regression: an Efficient Framework for Modeling Sensor Network Data." Information Processing in Sensor Networks (IPSN). pdf   talk        
BibTeX citation

@inproceedings{Guestrin+al:ipsn2004distregress,
author = {Carlos Guestrin and Peter Bodik and Romain Thibaux and Mark Paskin and Samuel Madden},
title = {Distributed Regression: an Efficient Framework for Modeling Sensor Network Data},
booktitle = {Information Processing in Sensor Networks (IPSN)},
year = {2004},
address = {Berkeley},
month = {April},
wwwfilebase = {ipsn2004-guestrin-bodik-thibaux-paskin-madden},
wwwtopic = {Sensor Networks}
}



full list