Sayan Mukherjee - US grants
Affiliations: | Duke University, Durham, NC |
Area:
Computational biology , statistical learning , Laplacian statisticsWebsite:
http://www.stat.duke.edu/~sayan/We are testing a new system for linking grants to scientists.
The funding information displayed below comes from the NIH Research Portfolio Online Reporting Tools and the NSF Award Database.The grant data on this page is limited to grants awarded in the United States and is thus partial. It can nonetheless be used to understand how funding patterns influence mentorship networks and vice-versa, which has deep implications on how research is done.
You can help! If you notice any innacuracies, please sign in and mark grants as correct or incorrect matches.
High-probability grants
According to our matching algorithm, Sayan Mukherjee is the likely recipient of the following grants.Years | Recipients | Code | Title / Keywords | Matching score |
---|---|---|---|---|
2007 — 2010 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Collaborative Research: Probabilistic Models and Geometry For High Dimensional Data @ Duke University The inference problems associated with high-dimensional genomic data offer fundamental challenges for modern statistics, machine learning, and data-mining research. Methods that have had success in this domain impose constraints on models incorporating notions of simplicity, smoothness, or robustness. The constraints are often formalized either as priors for Bayesian methods or as geometric criteria for machine learning methods. The heart of this proposal is to develop and relate the importance of the geometry underlying the data to probabilistic modeling. The specific research foci of the proposal are: 1) The exploitation of geometric assumptions for problems of model uncertainty and variable selection in high-dimensional models; 2) A Bayesian framework for the use of ancillary or unlabeled data in predictive modeling; 3) Theory, methods and computation for nonparametric Bayesian kernel models; 4) Novel methods for nonlinear dimension reduction for high-dimensional data from regularization and geometric perspectives. |
1 |
2010 — 2012 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
@ Duke University Data analysis is a fundamental problem in computational science, ubiquitous in a broad range of application fields, from computer graphics to geographics information system, from sensor networks to social networks, and from economics to biological science. Two complementary fields that have driven modern data analysis are computational geometry and statistical learning. The former focuses on detailed and precise models characterizing low-dimensional geometric phenomena. The latter focuses on robust or predictive inference of models given noisy high-dimensional data. This project aims to initiate a dialog between these two fields with geometry being the central theme. A closer interaction between them will benefit and advance both fields, and can potentially fundamentally change the way we view and perform data analysis. |
1 |
2011 — 2017 | Maggioni, Mauro (co-PI) [⬀] Bendich, Paul (co-PI) [⬀] Schmidler, Scott Harer, John [⬀] Mukherjee, Sayan Daubechies, Ingrid (co-PI) [⬀] |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Emsw21-Rtg: Geometric, Topological and Statistical Methods For Analyzing Massive Datasets @ Duke University In the past decade, the analysis of massive, high-dimensional, time-varying data sets has become a critical issue for a large number of scientists and engineers. Observations across several disciplines, by researchers studying dramatically different problems, suggest the existence of geometrical and topological structures in many data sets, and much current research is devoted to modeling and exploiting these structures to aid in prediction and information extraction. Recent work by the investigators, among others, has shown that integrating statistical methodologies with ideas derived from computational topology and diffusion geometry often leads to strikingly superior results than by conventional means. The investigators now propose to bring these methods into the mathematics/statistics curriculum and departmental structure in a formal way, by establishing a vertically integrated program of undergraduate and graduate research and education. This activity has broad support from programs within the Division of Mathematical Sciences, including Applied Mathematics, Computational Mathematics, Statistics and Topology programs, as well as Division of Mathematical Sciences Workforce Program. |
1 |
2012 — 2015 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Collaborative Research: Numerical Algebra and Statistical Inference @ Duke University The investigators have two aims in this proposal that fall at the interface of numerical algebra and statistical inference. The first aim is to extend the use of randomized approximation in a variety of dimension reduction methods that rely on numerical linear algebra both supervised and unsupervised as well as linear and nonlinear and develop a statistical bases for these methods in addition to the computational motivation of being applicable to massive data. The other motivation is to extend these statistical methods for dimension reduction to multiway data using numerical multilinear algebra, a recent new development in numerical analysis. These projects will increase interaction between statistical inference and numerical analysis and benefit both fields, providing new perspectives to how we view and perform data analysis. |
1 |
2013 — 2017 | Yang, Jun Babu, Shivnath (co-PI) [⬀] Ward, Michael Mukherjee, Sayan |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Iii: Small: Cumulon: Easy and Efficient Statistical Big-Data Analysis in the Cloud @ Duke University "Big data" have been growing in volume and diversity at an explosive rate, bringing enormous potential for transforming science and society. Driven by the desire to convert data into insights, analysis has become increasingly statistical, and there are more people than ever interested in analyzing big data. The rise of cloud computing in recent years offers a promising possibility for supporting big data analytics. However, it remains frustratingly difficult for many scientists and statisticians to use the cloud for any non-trivial statistical analysis of big data. |
1 |
2014 — 2017 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
@ Duke University Critical to advancing data-enabled science is our ability to probe, conceptualize, interpret, and visualize information residing in complex datasets in order to transform data into knowledge. This project develops computational methods and tools for investigation and visualization of structural variation in networks and 3D shapes, as well as inference of functional outcomes of such variation. These problems arise in areas of strategic interest such as health and medicine, where it is important to understand patterns of morphological variation in biological shapes and structural changes in biological networks, and their roles in behavior, health and disease. To emphasize this important interdisciplinary facet, the project is supported by several case studies that investigate: (i) mechanisms underlying the development of facial shape; (ii) interactions between brain shape, skull morphology, and behavior; (iii) organization of microbial communities in the digestive tract; and (iv) connections between social networks and microbiome networks. We envision many long-term ramifications of the project. Potential applications include analyses of dynamical social networks, exploratory discovery of associations between biological networks and phenotypic traits or diseases, quantitative studies of evolution, development, and inheritance of morphological traits, and challenges such as indexing and organizing databases of networks or 3D shapes for efficient data management, search, and retrieval. |
1 |
2015 — 2018 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
@ Duke University This research will leverage ideas from algebraic and differential geometry to address core problems in modern high-dimensional and massive data science. The project will develop statistical methods and numerical tools, grounded in solid mathematical, statistical, and computational foundations, to extract low dimensional geometry from massive data with applications in clustering, data summarization, prediction, dimension reduction, and visualization. The solutions developed as part of this project can result in fundamental advances in practical applications across fields as diverse as biology, medicine, social sciences, communication networks, and engineering. In addition to internal validation via statistical and mathematical theory and simulation studies, the methods developed in the project will involve external validation via interdisciplinary applications. These applications include: (1) inference of population structure from genomic data; (2) document analysis via topic models; and (3) inference of subsets of putative gene networks relevant to drug resistance in melanoma. |
1 |
2016 — 2019 | Nobel, Andrew (co-PI) [⬀] Mukherjee, Sayan Mcgoff, Kevin [⬀] |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Random Dynamical Systems and Limit Theorems For Optimal Tracking @ University of North Carolina At Charlotte Dynamical systems serve as important mathematical models for a wide variety of physical phenomena, arising in such areas as weather modeling, systems biology, and statistical physics. A dynamical system consists of a state space, in which a point represents a complete description of the state of the system, and a rule governing the evolution of the system from one state to another. This project focuses on the long-term behavior of such systems from two complementary points of view. From the first point of view, the project seeks to describe the behavior of typical systems when the rules of evolution are chosen at random. Such results shed light on what properties one might expect to find in disordered systems. The second point of view, the "inverse problem," concerns the statistical problem of recovering some information from the observation of a dynamical system. While there are many examples of dynamical systems being used as mathematical models, and there is a large statistical literature regarding inference and estimation, the performance of statistical procedures when applied to data generated by nonlinear dynamical systems is poorly understood. This project focuses on characterizing when traditional statistical procedures may be effectively applied in the context of dynamical systems. Beyond the very fertile potential applications, the project will also have broader impact on training of graduate students who will acquire invaluable skills in sound probabilistic modeling and statistical inference by working on the project's research topics. |
0.97 |
2017 — 2020 | Gunnell, Gregg (co-PI) [⬀] Mukherjee, Sayan Boyer, Douglas Mcgeary, Timothy |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
@ Duke University People and societies thrive best when they understand how the social and physical dynamics of their environment work, allowing them to respond appropriately. Natural scientists have built our understanding of the physical world. The scientific understanding they built has contributed to the development of technologies and practices that benefit human economies. For example, genetic sequencing of DNA enables deeper understanding of biological organisms; the consequences for human health, food production, understanding of evolutionary adaptation, etc. have been revolutionary and are still unfolding. The DNA sequence is the blueprint for an organism's anatomical structure (morphology) and function, but images capturing morphology are now much less prevalent than genetic data. Museums and researchers have been creating 3D digital images of natural history collections, and there are extensive 3D image data sets for some model organisms, but these data are mostly in closed collections, and generally unavailable or very difficult to access. This project aims to provide infrastructure to increase the accessibility of anatomical information, with a focus on 3D images. The resource will create the first open access, web-enabled image archive accepting and serving high-resolution, 3D scans of all organisms, called MorphoSource. Standardized descriptive tags will allow scientists to use this database to easily combine genetic and anatomical datasets for the first time, supporting the formulation of novel research questions. MorphoSource will link to other databases (such as iDigBio [www.idigbio.org]) that aggregate information on museum specimens from around the world. Having a shared common resource will change the culture among researchers and museums, making collaborations between physically distant experts more feasible, but it will also open the linked research collections of museums to anyone with Internet access anywhere in the world. Large data sets are prerequisites for many statistical and machine learning methods, so the resource will enable innovations in computational image analysis methods, fostering new types of collaborations that advance field-wide scientific understanding. The resource will track data use, enhancing reproducibility and also providing an objective metric of the value of individual data elements. Open access to the data linked through MorphoSource will enable anyone with Internet access to see the detailed anatomical evidence for theories like evolution. Pilot work has shown that teachers and students eagerly consume this newly available information, with numbers already in the thousands. Positive results of this access include (1) providing a more intuitive type of raw data (compared to DNA sequence) for showing the public why some conclusions about evolutionary relationships were reached, (2) providing an 'interest metric' for the value of natural history museums and the collections they hold, (3) increasing the community of people (including citizen scientists) who have access to the data required to make important discoveries by studying biological variation. |
1 |
2017 — 2020 | Mukherjee, Sayan | N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Beyond Riemannian Geometry in Inference @ Duke University A challenge in modern data science is how to integrate information from 3-dimensional shapes into statistical models. Examples of applications where this challenge is central is associating the shape of a tumor to molecular processes or associating the shape of the roots of a rice plant to crop yield. In graphics and anatomy, there is the related question of how to warp one object into another, such as the molar of a child to a molar of an adult. This project seeks to develop methodology to transform these shape data, such as meshes or 3-dimensional images, into representations for which standard statistical models are available. These methods are crucial to advancing data-enabled science in extracting, conceptualizing, interpreting, and visualizing information residing in datasets comprising complex objects such 3D shapes. Cutting edge ideas in mathematics, specifically from the field of geometry, will be used to address the fundamental problems of (i) modeling structural variation in diverse collections of shapes and (ii) modeling transformations between shapes. Solutions to these problems are crucial to many practical applications and disciplines including biology, medicine, social sciences, and ecology. As such, an important component of the project is validation of methods and tools through applications in radiology and anthropology |
1 |
2018 — 2021 | Gilbert, Jack (co-PI) [⬀] Archie, Elizabeth Mukherjee, Sayan |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Rol: Fels: Raise: Does Everyone's Microbiome Follow the Same Rules? @ University of Notre Dame Gut microbiomes are diverse microbial communities that play important roles in mammal physiology and health. However, scientists do not yet understand the rules that govern gut microbiome changes (such as how gut microbiomes change over time), or whether these rules are the same or different in different individuals. The goal of this project is to discover if gut microbiome dynamics are "universal" - that is, the same across hosts - or if these microbial changes are host-specific, such that each host's microbiome follows its own rules. Discovering whether each host's microbiome follows its own rules is relevant to human and animal health. If microbiome dynamics are universal, scientists can use these rules to design microbiome therapies that work the same way in different individuals. On the other hand, if these microbes are unique to each individual, then interventions to improve human or animal health must be designed separately for each individual. The work has benefits to broadening participation in STEM education through training in the investigators' labs and an educational outreach project with a public high school in Indiana. In addition, the researchers will make all statistical methods and code available online for use by other microbiome researchers and microbial ecologists. |
0.97 |
2019 — 2022 | Mukherjee, Sayan Rudin, Cynthia (co-PI) [⬀] Calderbank, Arthur Lu, Jianfeng (co-PI) [⬀] Ge, Rong (co-PI) [⬀] |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
@ Duke University This award supports TRIPODS@Duke Phase I, a project that will develop the foundations of data science both at Duke University and in the broader NC Research Triangle and surrounding region. A total of 25 faculty at Duke representing the disciplines of Computer Science, Electrical Engineering, Mathematics, and Statistical Science will be involved in Phase I. Activities include five semesters of workshops, with 3-4 one-week workshops each semester. These workshops will involve local and national participants and will bring experts on data science to the area. The project will support graduate students and postdoctoral trainees both in terms of education in the foundations of data science as well as in their professional development. Educational activities include the development and teaching of data science across curricula in Computer Science, Electrical and Computer Engineering, Mathematics, and Statistical Science, both at the undergraduate and graduate levels. The project will also leverage existing data science programs, including the Rhodes Information Initiative at Duke, a center for "big data" computational research and expanding opportunities for student engagement in data science; and the Statistical and Applied Mathematical Sciences Institute (SAMSI), one of the NSF/DMS-funded Mathematical Sciences Research Institutes (MSRIs), which is a partnership among Duke University, North Carolina State University (NCSU), and the University of North Carolina at Chapel Hill (UNC). |
1 |
2021 — 2024 | Nobel, Andrew [⬀] Mukherjee, Sayan Mcgoff, Kevin (co-PI) [⬀] |
N/AActivity Code Description: No activity code was retrieved: click on the grant title for more information |
Inference For Stationary Processes: Optimal Transport and Generalized Bayesian Approaches @ University of North Carolina At Chapel Hill This project will address the problem of making inferences about sequences of observations that exhibit dependence arising from physical or other interactions. Observations of this sort occur in many fields, including finance, ecology, natural language processing, and biology. We will explore ways to fit a sequence of observations to a family of statistical models using ideas from the theory of optimal transport. Informally, we will identify models in the family into which the generating mechanism of the observations can be transformed with the least overall cost. We will address both the theory and efficient computation of these transformation costs, and will consider applications to biomedicine and computer science. The project will involve collaborations with graduate students and more senior researchers working in genomics and bioinformatics. Both undergraduate and graduate students will receive training through involvement in supported research projects. |
0.97 |