Systems | Information | Learning | Optimization
 

Quantifying Ad Fraud – Contamination Estimation via Convex Relaxations

Identifying contamination in datasets is important in a wide variety of settings, including view and click fraud in online advertising. After a brief overview of digital ad fraud, I’ll describe a technique for estimating contamination in large, categorical datasets. The technique involves solving a series of convex programs, resulting in …

Internet Device Graphs

Digital adverting is arguably the largest and most ubiquitous application of machine learning. Learning algorithms pick the ads we see by inferring information about who we are and what we might buy. Graph datasets, due to their simplicity, play a central role in facilitating this inference. Internet Device Graphs are …

Dynamic optimization of fractionation schedules in radiation therapy and On Finding the Largest Mean Among Many

Jagdish Ramakrishnan In radiation therapy, the fractionation schedule, i.e. the total number of treatment days and the dose delivered per day, plays an important role in treatment outcome. In the first part of the talk, we analyze the effect of tumor repopulation on the optimal fractionation scheme. We find that …

Sequential Testing in High Dimensions | Relaxations for Production Planning Problems with Increasing By-products

Matt’s talk: Title: Sequential Testing in High Dimensions Sequential methods make use of information as it becomes available, creating an interactive connection between a sampling procedure and information gathered by that procedure. In this talk we explore sequential methods applied to sparse recovery problems, motivated by applications in both communications …