Antonio Tadeu Gomes - Profile on Academia.edu (original) (raw)

Papers by Antonio Tadeu Gomes

This work aims at implementing and evaluating metascheduling policies that employ little or no in... more This work aims at implementing and evaluating metascheduling policies that employ little or no information about the resources in the infrastructure they act upon. We have conducted some experiments with such policies in a simulation setting that models the one found on the Brazilian SINAPAD network. As expected, these experiments have shown that policies which employ promptly available resource information-such as CPU load-lead to better scheduling decisions than randomly selecting resources. Among the evaluated policies, the one with better performance was based on the history of previous job executions and on the expected CPU usage. Such policy, on the other hand, was the one with the largest decision time. Even so, we have identified a trend in the time spent by this policy for the decision making phase to be mitigated by the gains in the overall execution time of jobs scheduled by this policy. Grades computacionais têm se destacado ao longo dos anos entre as infraestruturas utilizadas para o processamento de diversas aplicac ¸ões científicas Trefethen 2003,Emmott 2005]. Nessas infraestruturas, middlewares de gerenciamento escondem a natureza altamente dinâmica e heterogênea dos recursos e, com frequência, implementam

Bioinformatics experiments are rapidly and constantly evolving due improvements in sequencing tec... more Bioinformatics experiments are rapidly and constantly evolving due improvements in sequencing technologies. These experiments usually demand high performance computation and produce huge quantities of data. They also require different programs to be executed in a certain order, allowing the experiments to be modeled as workflows. However, users do not always have the infrastructure needed to perform these experiments. Our contribution is the integration of scientific workflow management systems and grid-enabled scientific gateways, providing the user with a transparent way to run these workflows in geographically distributed computing resources. The availability of the workflow through the gateway allows for a better usability of these experiments.

Anais da VIII Escola Regional de Alto Desempenho do Rio de Janeiro (ERAD-RJ 2023)

Este trabalho apresenta um resultado de execução paralela no supercomputador Santos Dumont para u... more Este trabalho apresenta um resultado de execução paralela no supercomputador Santos Dumont para uma implementação do método CSEM 3D. Conceitos e métodos de implementação de processamento paralelo foram empregados para enfatizar o uso de recursos computacionais em cada nó. Assim, pode-se verificar que existe uma limitação na melhoria de desempenho ao aumentar o número de núcleos computacionais utilizados na arquitetura de CPU multicore disponível.

In this paper, we present MSLIO, a code to mimic the I/O behavior of multiscale simulations. Such... more In this paper, we present MSLIO, a code to mimic the I/O behavior of multiscale simulations. Such an I/O kernel is useful for HPC research, as it can be executed more easily and more efficiently than the full simulations when researchers are interested in the I/O load only. We validate MSLIO by comparing it to the I/O performance of an actual simulation, and we then use it to test some possible improvements to the output routine of the MHM (Multiscale Hybrid Mixed) library.

arXiv (Cornell University), Jul 15, 2022

Physics-Informed Neural Networks (PINNs) are machine learning tools that approximate the solution... more Physics-Informed Neural Networks (PINNs) are machine learning tools that approximate the solution of general partial differential equations (PDEs) by adding them in some form as terms of the loss/cost function of a Neural Network. Most pieces of work in the area of PINNs tackle non-linear PDEs. Nevertheless, many interesting problems involving linear PDEs may benefit from PINNs; these include parametric studies, multi-query problems, and parabolic (transient) PDEs. The purpose of this paper is to explore PINNs for linear PDEs whose solutions may present one or more boundary layers. More specifically, we analyze the steady-state reaction-advection-diffusion equation in regimes in which the diffusive coefficient is small in comparison with the reactive or advective coefficients. We show that adding information about these coefficients as predictor variables in a PINN results in better prediction models than a PINN that only uses spatial information as predictor variables. This finding may be instrumental in multiscale problems where the coefficients of the PDEs present high variability in small spatiotemporal regions of the domain, and therefore PINNs may be employed together with domain decomposition techniques to efficiently approximate the PDEs locally at each partition of the spatiotemporal domain, without resorting to different learned PINN models at each of these partitions.

HAL (Le Centre pour la Communication Scientifique Directe), Sep 9, 2020

The multiscale hybrid-mixed (MHM) method consists of a multi-level strategy to approximate the so... more The multiscale hybrid-mixed (MHM) method consists of a multi-level strategy to approximate the solution of boundary value problems with heterogeneous coefficients. In this context, we propose a new family of finite elements for the linear elasticity equation defined on coarse polytopal partitions of the domain. The finite elements rely on face degrees of freedom associated with multiscale bases obtained from local Neumann problems with polynomial interpolations on faces. We establish sufficient conditions on the fine-scale interpolations such that the MHM method is well-posed. Also, discrete traction stays in local equilibrium with external forces. We show by means of a multi-level analysis that the MHM method achieves optimal convergence under local regularity conditions without refining the coarse partition. The upshot is that the Poincaré and Korn's inequalities do not degenerate, and then convergence arises on general meshes. We employ two-and three-dimensional numerical tests to assess theoretical results and to verify the robustness of the method through a multi-layer media case. Also, we address computational aspects of the underlying parallel algorithm associated with different configurations of the MHM method; our aim is to find the best compromise between execution time and memory allocation to achieve a given error threshold.

arXiv (Cornell University), Aug 4, 2011

The analysis of large-scale complex networks is a major challenge in the Big Data domain. Given t... more The analysis of large-scale complex networks is a major challenge in the Big Data domain. Given the large-scale of the complex networks researchers commonly deal with nowadays, the use of localized information (i.e. restricted to a limited neighborhood around each node of the network) for centrality-based analysis is gaining momentum in the recent literature. In this context, we propose a framework for the Distributed Assessment of Network Centralities (DANCE) in complex networks. DANCE offers a single environment that allows the use of different localized centrality proposals, which can be tailored to specific applications. This environment can be thus useful given the vast potential applicability of centrality-based analysis on large-scale complex networks found in different areas, such as Biology, Physics, Sociology, or Computer Science. Since the localized centrality proposals DANCE implements employ only localized information, DANCE can easily benefit from parallel processing environments and run on different computing architectures. To illustrate this, we present a parallel implementation of DANCE and show how it can be applied to the analysis of large-scale complex networks using different kinds of network centralities. This implementation is made available to complex network researchers and practitioners interested in using it through a scientific web portal.

HAL (Le Centre pour la Communication Scientifique Directe), Dec 21, 2022

In this work we propose, analyze, and test a new multiscale finite element method called Multisca... more In this work we propose, analyze, and test a new multiscale finite element method called Multiscale Hybrid (MH) method. The method is built as a close relative to the Multiscale Hybrid Mixed (MHM) method, but with the fundamental difference that a novel definition of the Lagrange multiplier is introduced. The practical implication of this is that both the local problems to compute the basis functions, as well as the global problem, are elliptic, as opposed to the MHM method (and also other previous methods) where a mixed global problem is solved, and constrained local problems are solved to compute the local basis functions. The error analysis of the method is based on a hybrid formulation, and a static condensation process is done at the discrete level, so the final global system only involves the Lagrange multipliers. We tested the performance of the method by means of numerical experiments for problems with multiscale coefficients, and we carried out comparisons with the MHM method in terms of performance, accuracy, and memory requirements.

arXiv (Cornell University), Jun 4, 2022

The sequence of visits and procedures performed by the patient in the health system, also known a... more The sequence of visits and procedures performed by the patient in the health system, also known as the patient's pathway or trajectory, can reveal important information about the clinical treatment adopted and the health service provided. The rise of electronic health data availability made it possible to assess the pathways of a large number of patients. Nevertheless, some challenges also arose concerning how to synthesise these pathways and how to mine them from the data, fostering a new field of research. The objective of this review is to survey this new field of research, highlighting representation models, mining techniques, methods of analysis and examples of case studies.

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

Science gateways bring out the possibility of reproducible science as they are integrated into re... more Science gateways bring out the possibility of reproducible science as they are integrated into reusable techniques, data and workflow management systems, security mechanisms, and high performance computing (HPC). We introduce BioinfoPortal, a science gateway that integrates a suite of different bioinformatics applications using HPC and data management resources provided by the Brazilian National HPC System (SINAPAD). BioinfoPortal follows the Software as a Service (SaaS) model and the web server is freely available for academic use. The goal of this paper is to describe the science gateway and its usage, addressing challenges of designing a multiuser computational platform for parallel/distributed executions of large-scale bioinformatics applications using the Brazilian HPC resources. We also present a study of performance and scalability of some bioinformatics applications executed in the HPC environments and perform machine learning analyses for predicting features for the HPC allocation/usage that could better perform the bioinformatics applications via BioinfoPortal. Keywords-science gateway, bioinformatics, high performance computing I. INTRODUCTION Nowadays, genomics research shows an unprecedented effort in sequencing and categorizing genomes produced by new-generation high-throughput DNA sequencing [1]. The capacity for the biological data generation has led to an explosive growth of the complexity, heterogeneity, volume, and geographic dispersion of this biological "big data" [2]. Considering the annual growth of the generated data, it is estimated that the biological big data will reach 44 zettabytes in 2020 [3]. Thus, analyzing this volume of data is far from trivial. The integration of the latest breakthroughs in biomedical technology from one side and High Performance Computing (HPC), Scientific Workflow Management Systems (SWfMS) [4], and Database Management Systems (DBMS) [5] from another side, enables remarkable advances in the fields of healthcare, drug discovery, genome research,

Fourth Annual IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOMW'06)

After an Acute Myocardial Infarction (AMI), the sooner the patient is approached, the greater are... more After an Acute Myocardial Infarction (AMI), the sooner the patient is approached, the greater are the chances that pharmacological therapy (using thrombolytics) be more effective than surgical intervention. Nevertheless, the thrombolytic therapy may have hazard effects on AMI patients that present any contraindication to it. As a consequence, paramedics usually hesitate about applying the thrombolytic therapy-preferring to immediately transfer patients to coronary care units (CCUs)-unless cardiologists support their decision. To cope with this scenario, we envision a ubiquitous telemedicine system for supporting cardiologists and paramedics in (i) the remote decision upon the eligibility of AMI patients to the thrombolytic therapy and (ii) the remote monitoring of patients being transferred. socalled In this paper, we present AToMS (AMI Teleconsultation & Monitoring), a system that makes extensive use of (possibly heterogeneous) wireless communication technology to allow its use by a paramedic at the location where the AMI patient is first assisted, thus reducing the delay between the onset of symptoms and the eventual application of proper treatment. All exchanged messages among paramedics and cardiologists are recorded, thus rendering a fully auditable system.

Evolving Systems, 2022

Flight delays impose challenges that impact any flight transportation system. Predicting when the... more Flight delays impose challenges that impact any flight transportation system. Predicting when they are going to occur is an important way to mitigate this issue. However, the behavior of the flight delay system varies through time. This phenomenon is known in predictive analytics as concept drift. This paper investigates the prediction performance of different drift handling strategies in aviation under different scales (models trained from flights related to a single airport or the entire flight system). Specifically, two research questions were proposed and answered: (i) How do drift handling strategies influence the prediction performance of delays? (ii) Do different scales change the results of drift handling strategies? In our analysis, drift handling strategies are relevant, and their impacts vary according to scale and machine learning models used.

ArXiv, 2017

The family of Multiscale Hybrid-Mixed (MHM) finite element methods has received considerable atte... more The family of Multiscale Hybrid-Mixed (MHM) finite element methods has received considerable attention from the mathematics and engineering community in the last few years. The MHM methods allow solving highly heterogeneous problems on coarse meshes while providing solutions with high-order precision. It embeds independent local problems which are responsible for upscaling unresolved scales into the numerical solution. These local contributions are brought together through a global problem defined on the skeleton of the coarse partition. Since the local problems are completely independent, they can be easily computed in parallel. In this paper, we present two simulator prototypes specifically crafted for the MHM methods, which adopt two different implementation strategies: (i) a multi-programming language approach, each language tackling different simulation issues; and (ii) a classical, single-programming language approach. Specifically, we use C++ for numerical computation of the ...

Concurrency and Computation: Practice and Experience, 2019

SummaryWorkload‐aware loop schedulers were introduced to deliver better performance than classica... more SummaryWorkload‐aware loop schedulers were introduced to deliver better performance than classical loop scheduling strategies. However, they presented limitations such as inflexible built‐in workload estimators and suboptimal chunk scheduling. Targeting these challenges, we proposed previously a workload‐aware scheduling strategy called BinLPT, which relies on three features: (i) user‐supplied estimations of the workload of the loop; (ii) a greedy heuristic that adaptively partitions the iteration space in several chunks; and (iii) a scheduling scheme based on the Longest Processing Time (LPT) rule and on‐demand technique. In this paper, we present two new contributions to the state‐of‐the‐art. First, we introduce a multiloop support feature to BinLPT, which enables the reuse of estimations across loops. Based on this feature, we integrated BinLPT into a real‐world elastodynamics application, and we evaluated it running on a supercomputer. Second, we present an evaluation of BinLPT ...

ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240)

This paper proposes an approach for representing and programming QoS functions in communication s... more This paper proposes an approach for representing and programming QoS functions in communication systems. It presents a model that gives adequate support for defining: (i) communication environments and their adaptation mechanisms, and (ii) frameworks that delineate QoS-specific abstractions that appear within any communication environment.

puc-rio.br Resumo. A diversidade de configurações possíveis dos modelos intserv e diffserv e das ... more puc-rio.br Resumo. A diversidade de configurações possíveis dos modelos intserv e diffserv e das várias modalidades de provisão de QoS no nível das sub-redes torna difícil a compreensão de que tipo de modelo e tecnologia de subrede deve ser utilizado para se realizar um determinado serviço. Adicionalmente, a contínua evolução tecnológica sugere o desenvolvimento de arquiteturas flexíveis o suficiente para acomodar, em tempo de operação, adaptações que hoje só são possíveis por meio de procedimentos menos dinâmicos como atualização de hardware ou firmware. Este trabalho propõe uma arquitetura adaptável para provisão de QoS na Internet que é independente do modelo de serviço e dos mecanismos de provisão, incluindo a tecnologia de sub-rede empregada pelo provedor de serviços. Mostra-se como, a partir da definição de um framework genérico, pode-se especializar pontos de flexibilização para implementar esses dois modelos utilizados para prover serviços com QoS na Internet. Em seguida, é proposta uma arquitetura geral que permite que as funções de provisão de QoS em estações e roteadores passem a ser representadas independente dos modelos de serviços e sub-redes existentes.

Journal of Communication and Information Systems, 2003

The increasing demand for distributed multimedia applications makes evident the need for end-to-e... more The increasing demand for distributed multimedia applications makes evident the need for end-to-end quality of service (QoS) provisioning. Pmticularly, operating systems, despite their location at end systems, switches or routers, must guarantee that resources under their control are adequately managed to fulfill the application requirements. This work proposes an architecture for adaptive QoS provisioning on network operating systems (QoSOS), focusing mainly on the packet queuing subsystem. The development of such architecture came after an analysis of solutions currently found in the literature and the perception of their functional similarities. QoSOS allows the reuse of common functions and the definition of an internal organization that is equivalent in different systems. In order to demonstrate how QoSOS can be applied in a real QoS provisioning scenario, the paper describes the modeling and implementation of an adaptable Intserv support, focusing on the management of the output queues of the Linux operating system. The architecture instantiation is based on few modifications introduced into the standard Linux kemel, that adds some desirable features such as runtime service adaptation.