less than 1 minute read

Header

Robust Distributed Learning of Functional Data From Simulators through Data Sketching

  • Time: Monday 1/29/2024 from 11:40 AM to 12:30 PM
  • Location: BLOC 448
  • Pizza and drinks provided

Description

Emerging distributed Bayesian learning frameworks partition raw data into shards and distribute computations of these shards across machines. While this strategy mitigates data storage costs and improves computation within each machine, concerns arise regarding the sensitivity of distributed inference to shard selection. Our research proposes an innovative alternative. Instead of creating data shards, we employ multiple random matrices to construct multiple random linear projections, or “random sketches,” of the complete dataset. Posterior inference on functional data models is performed using random sketches on various machines in parallel. These individual inferences are then combined across machines at a central server. By aggregating inference across diverse random matrices, our approach proves resilient to the selection of data sketches, leading to the development of novel robust distributed Bayesian learning approach. An important advantage of our approach is its ability to maintain the privacy of sampling units, as the inference is based on random data sketches that do not allow the recovery of raw data.

Presentation

Recording

IMG_0133.jpg IMG_0128.jpg IMG_0124.jpg IMG_0117.jpg

Categories:

Updated: