Международные научные проекты (original) (raw)

В 2021-22 учебном году у студентов ФКН есть возможность выбрать иностранного научного руководителя курсовой или выпускной квалификационной работы. Список руководителей и предлагаемых ими тем представлен ниже.

Исследователь Аффилиация Название проекта на английском языке Название проекта на русском языке Требования Длительность Описание
Tommaso Dorigo National Institute of Nuclear Physics (Padova section) End-to-end optimization of the design of a muon tomography apparatus Сквозная оптимизация аппарата для мюонной томографии Python programming, notions of machine learning. Some prior knowledge of PyTorch is useful. Time involvement: from three to six months. By using differentiable programming libraries of PyTorch we are developing a pipeline for the optimization of the performance of a muon tomography detector in estimating the atomic number of an unknown volume of material, by choosing a layout, geometry, and cost of detection elements. The student would help code some of the modules of the pipeline and extend its functionality. Possibility to co-author a publication of the proof-of-principle software.
Tommaso Dorigo National Institute of Nuclear Physics (Padova section) End-to-end optimization of the design of a hybrid calorimeter Сквозная оптимизация гибридного калориметра Python programming, notions of machine learning. Some prior knowledge of PyTorch or TensorFlow is useful. Time involvement: six months or more. Differentiable programming may enable the investigation of design choices for a calorimeter for a particle physics experiment. In this open-ended study, the student will consider possible layouts of a device comprised of tracking as well as scintillation layers, with the purpose of understanding the existence of novel design solutions to the problem of inferring the energy and direction of particle showers. Possibility to co-author a publication on the studies performed.
Jie-Hong Roland Jiang National Taiwan University Language and string constraint solving Boolean circuit learning Quantified decision procedures Решение языковых и строковых ограничений Обучение булевым схемам Количественные процедуры принятия решений
Davide Bresolin University of Padova Automata theory, formal methods, model checking, cyberphysical-systems (precise topics upon inquiry) Теория автоматов, формальные методы, проверка моделей, киберфизические системы (точные темы по запросу) Up to two master students On-site supervision
Paolo Baldan University of Padova Semantic foundations and formal methods for software systems Семантические основы и формальные методы для программных систем Up to two master students On-site supervision
Mauro Conti University of Padova Cybersecurity Кибербезопасность Up to three master students Both remote and on-site supervision
Massimiliano de Leoni University of Padova (Multi-perspective) Process Mining and Conformance Checking (Многоперспективный) поиск процессов и проверка соответствия Up to two master students Preferably on-site supervision; if not, remotely, or dual
Francesco Rinaldi University of Padova Optimization Methods for Data Science and Complex Systems Методы оптимизации для науки о данных и сложных систем Up to four master students Both remote and on-site supervision
Paolo Guiotto University of Padova Optimal Decisions in Integrated Operations Management Оптимальные решения в интегрированном управлении операциями Up to two master students Both remote and on-site supervision
Maria Emilia Maietti University of Padova Categorical logic and type theory Категориальная логика и теория типов Up to three master students Both remote and on-site supervision
Claudio Marchi University of Padova Mean Field Games systems: first order or second order case Partial Differential Equation on networks Системы игр со средним полем: случай первого или второго порядка Частные дифференциальные уравнения в сетях One bachelor or master student Both remote and on-site supervision
Fabio Paronetto University of Padova Harnack's inequality for elliptic and parabolic equations Gamma-convergence and/or G-convergence Неравенство Харнака для эллиптических и параболических уравнений Гамма-конвергенция и/или G-конвергенция One bachelor and one master student Both remote and on-site supervision
Paolo Rossi University of Padova Integrable systems and moduli space of curves Интегрируемые системы и модульное пространство кривых Up to two master students On-site supervision The moduli space of curves classifies all possible complex smooth algebraic curves (Riemann surfaces) and can be compactified to produce a closed compact space (a complex orbifold, to be precise). Each point represents a Riemann surface, up to reparametrization. Given the centrality of Riemann surface geometry in several fields of mathematics, from geometry to mathematical physics, the study of the topology of their moduli space is an important classical problem. Since the 90s and Witten's conjecture (a surprising intuition coming from string theory, later proved by Kontsevich) it has been known that integrable systems of Hamiltonian PDEs (an infinite-dimensional application of Arnold-Liouville integrability) control intersection theory of the moduli space of curves. This interaction has been fruitful both for geometry and mathematical physics. There are several special and very modern topics that use beautiful mathematics and could form the heart of a research-oriented master thesis. The details can be discussed and decided with the interested student.
Angelos Anadiotis Ioana Manolescu Ecole Polytechnique, Institut Polytechnique de Paris Distributed Big Data & ML architecture for ConnectionLens Распределенная архитектура больших данных и машинного обучения для ConnectionLens One master student or a group of two students • Relational database management systems • Graph data management • Distributed systems • Machine learning techniques for natural language processing Both remote and on-site supervision ConnectionLens is a system that finds connections between user-specified search terms across heterogeneous data sources. ConnectionLens treats a set of heterogeneous, independently authored data sources as a single virtual graph, where nodes represent fine-grained data items (relational tuples, attributes, key-value pairs, RDF, JSON or XML nodes…) and edges correspond either to structural connections (e.g., a tuple is in a database, an attribute is in a tuple, a JSON node has a parent) or to similarity (sameAs) links. Currently, the pipeline is centralized, that is, it is deployed on a single, scale-up server. The goal of the thesis is to scale the end-to-end graph construction pipeline, to leverage a distributed computational infrastructure (i.e., cluster). Achieving this goal requires two main tasks: • Design and implement a distributed version of the graph construction algorithm, including database and Machine Learning (Information Extraction) computations • Store the graph in a distributed store, such as Impala, Cassandra, HBase, while also considering pure graph stores such as MongoDB and Neo4j. Full description and references (PDF, 79 Кб)
Jesse Read LIX Laboratory, Ecole Polytechnique, Institute Polytechnique de Paris Bayesian-Neural Methods for Missing Data Imputation with Applications in Bioinformatics Байесовско-нейронные методы для введения отсутствующих данных с приложениями в биоинформатике The project can be adapted to either bachelor or master level. Knowledge of and experience in machine learning, including at least one deep-learning framework (e.g., TensorFlow or PyTorch) and scientific programming in Python (including use of libraries such as Numpy), and specifically awareness of probabilistic views of inference, including Bayesian methods. Some relevant bioinformatics knowledge could make the topic more appreciable. The mode of work will be initially online, but travel and stay at the LIX laboratory will be a possibility to explore, Missing data is a universal problem in data science and machine learning, and impacts many domains. Imputation can be approached as a machine learning task itself, by building models and using them to predict the missing values. These models can predict single or multiple values, either within a common row-instance or across the dataset within a common feature-column. Deep neural networks are a popular and successful class of model in general, particularly when multiple outputs are involved, yet usually, these methods only provide point estimates. This proposed topic balances between the areas of machine learning and Bayesian inference. There are a number of methods found on this boundary that can be of interest, and among them, this project would target specifically Bayesian Neural Networks. We want not only to test and further develop this approach for missing values imputation (and specifically, in the domain of interest of SNP data), but also explore ways to leverage uncertainty information from the imputation task in a separate classification/regression task (using the imputed data). Although the main application domain is SNP datasets, methods can be tested in other domains including medical data (which we have available) and other tasks related to imputation, such as anomaly detection and recommendation systems. Full description and references (PDF, 117 Кб)
Eric Goubault Sylvie Putot Ecole Polytechnique, Institute Polytechnique de Paris Verification of neural network based controllers Верификация контроллеров на основе нейронных сетей Mostly master, but end bachelor for some parts of the project as well. No more than two. Ideally, some knowledge on verification, formal methods, set-based methods, basic control theory and basic AI (neural nets) Both remote and on-site supervision The student will look into one or more, of these subjects: - set-based abstractions of RELU activation functions, even swish, soft relu etc. (We already have sigmoids, tanh etc.) for inner and outer set approximation approaches, implemented in RINO https://github.com/cosynus-lix/RINO - an interpretation of network architectures more general than feedforward neural nets (with CNN blocks etc., at least an important fragment of ONNX) - abstractions for recurrent networks (initially, zonotopic interpretations and the use of our fixed point computation methods in the zonotopic domain see e.g. https://arxiv.org/abs/0910.1763) - a better implementation (in particular with an external and not internal representation) of the interpretation of RELU networks by tropical polyhedra, cf. http://www.lix.polytechnique.fr/Labo/Sylvie.Putot/Publications/sas21.pdf - extensions to this SAS21 paper (in particular, tropical polynomials, by simple techniques of linear representation on a monomial basis, improvement of the interpretation of multi-layer networks, e.g. by using intermediate zonotopes instead of hypercubes etc.) - develop examples of neural network controllers for RINO and various tools (NNV, ReachNN, Verisig, flowstar in particular) around the F1/10tenth cf. https://arxiv.org/pdf/1910.11309.pdf and https://github.com/rivapp/autonomous\_car\_verification