Armin Shmilovici - Academia.edu (original) (raw)

Papers by Armin Shmilovici

Research paper thumbnail of On the Application of Statistical Process Control to the Stochastic Complexity of Discrete Processes

International Conference of Computational Methods in Sciences and Engineering 2004 (ICCMSE 2004), 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Semantic Comparison of State-of-the-Art Deep Learning Methods for Image Multi-Label Classification

ArXiv, 2019

Image understanding relies heavily on accurate multi-label classification. In recent years deep l... more Image understanding relies heavily on accurate multi-label classification. In recent years deep learning (DL) algorithms have become very successful tools for multi-label classification of image objects. With these set of tools, various implementations of DL algorithms have been released for the public use in the form of application programming interfaces (API). In this study, we evaluate and compare 10 of the most prominent publicly available APIs in a best-of-breed challenge. The evaluation is performed on the Visual Genome labeling benchmark dataset using 12 well-recognized similarity metrics. In addition, for the first time in this kind of comparison, we use a semantic similarity metric to evaluate the semantic similarity performance of these APIs. In this evaluation, Microsoft's Computer Vision, TensorFlow, Imagga, and IBM's Visual Recognition showed better performance than the other APIs. Furthermore, the new semantic similarity metric allowed deeper insights for compa...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Comparison of state-of-the-art deep learning APIs for image multi-label classification using semantic metrics

Expert Systems with Applications, 2020

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Identifying turning points in animated cartoons

Expert Systems with Applications, 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Accelerating the relevance vector machine via data partitioning

Foundations of Computing and Decision Sciences, 2006

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A VOM based gene-finder that specializes in short genes

We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM ... more We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM model is a generalization of the traditional Markov model; it is more efficient in terms of its parameterization, and, thus, can be trained on relatively short sequences. As a result, the proposed VOM gene-finder outperforms traditional gene-finders that are based on fifth-order Markov models for short newly sequenced bacterial genomes.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Fuzzy Systems

⎯ A data warehouse is a special database used for storing business oriented information for futur... more ⎯ A data warehouse is a special database used for storing business oriented information for future analysis and decision-making. In business scenarios, where some of the data or the business attributes are fuzzy, it may be useful to construct a warehouse that can support the analysis of fuzzy data. Here, we outline how Kimball‘s methodology for the design of a data warehouse can be extended to the construction of a fuzzy data warehouse. A case study demonstrates the viability of the methodology.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Towards Automatic Textual Summarization of Movies

Recent Developments and the New Direction in Soft-Computing Foundations and Applications, 2020

With the rapidly increasing number of online video resources, the ability of automatically unders... more With the rapidly increasing number of online video resources, the ability of automatically understanding those videos becomes more and more important, since it is almost impossible for people to watch all of the videos and provide textual descriptions. The duration of online videos varies in a extremely wide range, from several seconds to more than 5 h. In this paper, we focus on long videos, especially on full-length movies, and propose the first pipeline for automatically generating textual summaries of such movies. The proposed system takes an entire movie as input (including subtitles), splits it into scenes, generates a one-sentence description for each scene and summarizes those descriptions and subtitles into a final summary. In our initial experiment on a popular cinema movie (Forrest Gump), we utilize several existing algorithms and software tools for implementing the different components of our system. Most importantly, we use the S2VT (Sequence to Sequence—Video to Text) algorithm for scene description generation and MUSEEC (MUltilingual SEntence Extraction and Compression) for extractive text summarization. We present preliminary results from our prototype experimental framework. An evaluation of the resulting textual summaries for a movie made of 156 scenes demonstrates the feasibility of the approach—the summary contains the descriptions of three out of the four most important scenes/storylines in the movie. Although the summaries are far from satisfactory, we argue that the current results can be used to prove the merit of our approach.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Automatic generation of composite image descriptions

2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), 2017

Automatic generation of natural language descriptions for images has recently become an important... more Automatic generation of natural language descriptions for images has recently become an important research topic. In this paper, we propose a frame-based algorithm for generating a composite natural language description for a given image. The goal of this algorithm is to describe not only the objects appearing in the image but also the main activities happening in the image and the objects participating in those activities. The algorithm builds upon a pre-trained CRF (Conditional Random Field)-based structured prediction model, which generates a set of alternative frames for a given image. We use imSitu, a situation recognition dataset with 126,102 images, 504 activities, 11,538 objects, and 1,788 roles, as a test bed of our algorithm. We ask human evaluators to evaluate the quality of the descriptions for 20 images from the imSitu dataset. The results demonstrate that our composite description contains on average 16% more visual elements than the baseline method and gains a significantly higher accuracy score by the human evaluators.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A Feature Selection Strategy for the Relevance Vector Machine

The Relevance Vector Machine (RVM) is a generalized linear model that can use kernel functions as... more The Relevance Vector Machine (RVM) is a generalized linear model that can use kernel functions as basis functions. The typical RVM solution is very sparse. We present a strategy for feature ranking and selection via evaluating the influence of the features on the relevance vectors. This requires a single training of the RVM, thus, it is very efficient. Experiments on a benchmark regression problem provide evidence that it selects high-quality feature sets at a fraction of the costs of classical methods. Key-Words: Feature Selection, Relevance Vector Machine, Machine Learning

Bookmarks Related papers MentionsView impact

Research paper thumbnail of On the Application of Statistical Process Control to the Stochastic Complexity of Discrete Processes

International Conference of Computational Methods in Sciences and Engineering 2004 (ICCMSE 2004), 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Semantic Comparison of State-of-the-Art Deep Learning Methods for Image Multi-Label Classification

ArXiv, 2019

Image understanding relies heavily on accurate multi-label classification. In recent years deep l... more Image understanding relies heavily on accurate multi-label classification. In recent years deep learning (DL) algorithms have become very successful tools for multi-label classification of image objects. With these set of tools, various implementations of DL algorithms have been released for the public use in the form of application programming interfaces (API). In this study, we evaluate and compare 10 of the most prominent publicly available APIs in a best-of-breed challenge. The evaluation is performed on the Visual Genome labeling benchmark dataset using 12 well-recognized similarity metrics. In addition, for the first time in this kind of comparison, we use a semantic similarity metric to evaluate the semantic similarity performance of these APIs. In this evaluation, Microsoft's Computer Vision, TensorFlow, Imagga, and IBM's Visual Recognition showed better performance than the other APIs. Furthermore, the new semantic similarity metric allowed deeper insights for compa...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Comparison of state-of-the-art deep learning APIs for image multi-label classification using semantic metrics

Expert Systems with Applications, 2020

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Identifying turning points in animated cartoons

Expert Systems with Applications, 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Accelerating the relevance vector machine via data partitioning

Foundations of Computing and Decision Sciences, 2006

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Do Famous People Live Longer? A Wikipedia Analysis

Occupational factors have long been linked to patterns of mortality. Based on the premise that an... more Occupational factors have long been linked to patterns of mortality. Based on the premise that an entry in an encyclopedia tends to imply success in one's vocation, we used Wikipedia biographical entries (English version) to elucidate the relationship between career success and longevity. Analyzing 7756 Wikipedia entries for persons deceased between 2009 and 2011 in terms of gender, occupation and longevity, we found that male entries outnumbered female (6548 vs. 1208), and the mean age of death was lower for males than females (76.31 vs. 78.50 years). Younger ages of death were evident among sports players and Performing artists (73.04) and creative workers (74.68). Older deaths were seen in professionals and academics (82.63). Since these results are comparable with those found in the literature, they validate the use of Wikipedia for population studies. The gender classification procedure we developed for the biographical entries in order to obtain an occupation-by-gender com...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A VOM based gene-finder that specializes in short genes

We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM ... more We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM model is a generalization of the traditional Markov model; it is more efficient in terms of its parameterization, and, thus, can be trained on relatively short sequences. As a result, the proposed VOM gene-finder outperforms traditional gene-finders that are based on fifth-order Markov models for short newly sequenced bacterial genomes.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Context dependent ARMA modeling

21st IEEE Convention of the Electrical and Electronic Engineers in Israel. Proceedings (Cat. No.00EX377), 2000

ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of th... more ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of the partial hidden Markov model-to continuous signals. While the original algorithm is dedicated to modeling the context in which each symbol can occur in a discrete symbol space, we propose to match a specific ARMA model with each identified context. An example is presented

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Do Famous People Live Longer? A Wikipedia Analysis

Occupational factors have long been linked to patterns of mortality. Based on the premise that an... more Occupational factors have long been linked to patterns of mortality. Based on the premise that an entry in an encyclopedia tends to imply success in one's vocation, we used Wikipedia biographical entries (English version) to elucidate the relationship between career success and longevity. Analyzing 7756 Wikipedia entries for persons deceased between 2009 and 2011 in terms of gender, occupation and longevity, we found that male entries outnumbered female (6548 vs. 1208), and the mean age of death was lower for males than females (76.31 vs. 78.50 years). Younger ages of death were evident among sports players and Performing artists (73.04) and creative workers (74.68). Older deaths were seen in professionals and academics (82.63). Since these results are comparable with those found in the literature, they validate the use of Wikipedia for population studies. The gender classification procedure we developed for the biographical entries in order to obtain an occupation-by-gender com...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Context dependent ARMA modeling

21st IEEE Convention of the Electrical and Electronic Engineers in Israel. Proceedings (Cat. No.00EX377), 2000

ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of th... more ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of the partial hidden Markov model-to continuous signals. While the original algorithm is dedicated to modeling the context in which each symbol can occur in a discrete symbol space, we propose to match a specific ARMA model with each identified context. An example is presented

Bookmarks Related papers MentionsView impact

Research paper thumbnail of On the Application of Statistical Process Control to the Stochastic Complexity of Discrete Processes

International Conference of Computational Methods in Sciences and Engineering 2004 (ICCMSE 2004), 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Semantic Comparison of State-of-the-Art Deep Learning Methods for Image Multi-Label Classification

ArXiv, 2019

Image understanding relies heavily on accurate multi-label classification. In recent years deep l... more Image understanding relies heavily on accurate multi-label classification. In recent years deep learning (DL) algorithms have become very successful tools for multi-label classification of image objects. With these set of tools, various implementations of DL algorithms have been released for the public use in the form of application programming interfaces (API). In this study, we evaluate and compare 10 of the most prominent publicly available APIs in a best-of-breed challenge. The evaluation is performed on the Visual Genome labeling benchmark dataset using 12 well-recognized similarity metrics. In addition, for the first time in this kind of comparison, we use a semantic similarity metric to evaluate the semantic similarity performance of these APIs. In this evaluation, Microsoft's Computer Vision, TensorFlow, Imagga, and IBM's Visual Recognition showed better performance than the other APIs. Furthermore, the new semantic similarity metric allowed deeper insights for compa...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Comparison of state-of-the-art deep learning APIs for image multi-label classification using semantic metrics

Expert Systems with Applications, 2020

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Identifying turning points in animated cartoons

Expert Systems with Applications, 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Accelerating the relevance vector machine via data partitioning

Foundations of Computing and Decision Sciences, 2006

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A VOM based gene-finder that specializes in short genes

We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM ... more We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM model is a generalization of the traditional Markov model; it is more efficient in terms of its parameterization, and, thus, can be trained on relatively short sequences. As a result, the proposed VOM gene-finder outperforms traditional gene-finders that are based on fifth-order Markov models for short newly sequenced bacterial genomes.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Fuzzy Systems

⎯ A data warehouse is a special database used for storing business oriented information for futur... more ⎯ A data warehouse is a special database used for storing business oriented information for future analysis and decision-making. In business scenarios, where some of the data or the business attributes are fuzzy, it may be useful to construct a warehouse that can support the analysis of fuzzy data. Here, we outline how Kimball‘s methodology for the design of a data warehouse can be extended to the construction of a fuzzy data warehouse. A case study demonstrates the viability of the methodology.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Towards Automatic Textual Summarization of Movies

Recent Developments and the New Direction in Soft-Computing Foundations and Applications, 2020

With the rapidly increasing number of online video resources, the ability of automatically unders... more With the rapidly increasing number of online video resources, the ability of automatically understanding those videos becomes more and more important, since it is almost impossible for people to watch all of the videos and provide textual descriptions. The duration of online videos varies in a extremely wide range, from several seconds to more than 5 h. In this paper, we focus on long videos, especially on full-length movies, and propose the first pipeline for automatically generating textual summaries of such movies. The proposed system takes an entire movie as input (including subtitles), splits it into scenes, generates a one-sentence description for each scene and summarizes those descriptions and subtitles into a final summary. In our initial experiment on a popular cinema movie (Forrest Gump), we utilize several existing algorithms and software tools for implementing the different components of our system. Most importantly, we use the S2VT (Sequence to Sequence—Video to Text) algorithm for scene description generation and MUSEEC (MUltilingual SEntence Extraction and Compression) for extractive text summarization. We present preliminary results from our prototype experimental framework. An evaluation of the resulting textual summaries for a movie made of 156 scenes demonstrates the feasibility of the approach—the summary contains the descriptions of three out of the four most important scenes/storylines in the movie. Although the summaries are far from satisfactory, we argue that the current results can be used to prove the merit of our approach.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Automatic generation of composite image descriptions

2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), 2017

Automatic generation of natural language descriptions for images has recently become an important... more Automatic generation of natural language descriptions for images has recently become an important research topic. In this paper, we propose a frame-based algorithm for generating a composite natural language description for a given image. The goal of this algorithm is to describe not only the objects appearing in the image but also the main activities happening in the image and the objects participating in those activities. The algorithm builds upon a pre-trained CRF (Conditional Random Field)-based structured prediction model, which generates a set of alternative frames for a given image. We use imSitu, a situation recognition dataset with 126,102 images, 504 activities, 11,538 objects, and 1,788 roles, as a test bed of our algorithm. We ask human evaluators to evaluate the quality of the descriptions for 20 images from the imSitu dataset. The results demonstrate that our composite description contains on average 16% more visual elements than the baseline method and gains a significantly higher accuracy score by the human evaluators.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A Feature Selection Strategy for the Relevance Vector Machine

The Relevance Vector Machine (RVM) is a generalized linear model that can use kernel functions as... more The Relevance Vector Machine (RVM) is a generalized linear model that can use kernel functions as basis functions. The typical RVM solution is very sparse. We present a strategy for feature ranking and selection via evaluating the influence of the features on the relevance vectors. This requires a single training of the RVM, thus, it is very efficient. Experiments on a benchmark regression problem provide evidence that it selects high-quality feature sets at a fraction of the costs of classical methods. Key-Words: Feature Selection, Relevance Vector Machine, Machine Learning

Bookmarks Related papers MentionsView impact

Research paper thumbnail of On the Application of Statistical Process Control to the Stochastic Complexity of Discrete Processes

International Conference of Computational Methods in Sciences and Engineering 2004 (ICCMSE 2004), 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Semantic Comparison of State-of-the-Art Deep Learning Methods for Image Multi-Label Classification

ArXiv, 2019

Image understanding relies heavily on accurate multi-label classification. In recent years deep l... more Image understanding relies heavily on accurate multi-label classification. In recent years deep learning (DL) algorithms have become very successful tools for multi-label classification of image objects. With these set of tools, various implementations of DL algorithms have been released for the public use in the form of application programming interfaces (API). In this study, we evaluate and compare 10 of the most prominent publicly available APIs in a best-of-breed challenge. The evaluation is performed on the Visual Genome labeling benchmark dataset using 12 well-recognized similarity metrics. In addition, for the first time in this kind of comparison, we use a semantic similarity metric to evaluate the semantic similarity performance of these APIs. In this evaluation, Microsoft's Computer Vision, TensorFlow, Imagga, and IBM's Visual Recognition showed better performance than the other APIs. Furthermore, the new semantic similarity metric allowed deeper insights for compa...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Comparison of state-of-the-art deep learning APIs for image multi-label classification using semantic metrics

Expert Systems with Applications, 2020

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Identifying turning points in animated cartoons

Expert Systems with Applications, 2019

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Accelerating the relevance vector machine via data partitioning

Foundations of Computing and Decision Sciences, 2006

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Do Famous People Live Longer? A Wikipedia Analysis

Occupational factors have long been linked to patterns of mortality. Based on the premise that an... more Occupational factors have long been linked to patterns of mortality. Based on the premise that an entry in an encyclopedia tends to imply success in one's vocation, we used Wikipedia biographical entries (English version) to elucidate the relationship between career success and longevity. Analyzing 7756 Wikipedia entries for persons deceased between 2009 and 2011 in terms of gender, occupation and longevity, we found that male entries outnumbered female (6548 vs. 1208), and the mean age of death was lower for males than females (76.31 vs. 78.50 years). Younger ages of death were evident among sports players and Performing artists (73.04) and creative workers (74.68). Older deaths were seen in professionals and academics (82.63). Since these results are comparable with those found in the literature, they validate the use of Wikipedia for population studies. The gender classification procedure we developed for the biographical entries in order to obtain an occupation-by-gender com...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of A VOM based gene-finder that specializes in short genes

We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM ... more We present a new approach for gene finding based on a variable-order Markov (VOM) model. The VOM model is a generalization of the traditional Markov model; it is more efficient in terms of its parameterization, and, thus, can be trained on relatively short sequences. As a result, the proposed VOM gene-finder outperforms traditional gene-finders that are based on fifth-order Markov models for short newly sequenced bacterial genomes.

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Context dependent ARMA modeling

21st IEEE Convention of the Electrical and Electronic Engineers in Israel. Proceedings (Cat. No.00EX377), 2000

ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of th... more ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of the partial hidden Markov model-to continuous signals. While the original algorithm is dedicated to modeling the context in which each symbol can occur in a discrete symbol space, we propose to match a specific ARMA model with each identified context. An example is presented

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Do Famous People Live Longer? A Wikipedia Analysis

Occupational factors have long been linked to patterns of mortality. Based on the premise that an... more Occupational factors have long been linked to patterns of mortality. Based on the premise that an entry in an encyclopedia tends to imply success in one's vocation, we used Wikipedia biographical entries (English version) to elucidate the relationship between career success and longevity. Analyzing 7756 Wikipedia entries for persons deceased between 2009 and 2011 in terms of gender, occupation and longevity, we found that male entries outnumbered female (6548 vs. 1208), and the mean age of death was lower for males than females (76.31 vs. 78.50 years). Younger ages of death were evident among sports players and Performing artists (73.04) and creative workers (74.68). Older deaths were seen in professionals and academics (82.63). Since these results are comparable with those found in the literature, they validate the use of Wikipedia for population studies. The gender classification procedure we developed for the biographical entries in order to obtain an occupation-by-gender com...

Bookmarks Related papers MentionsView impact

Research paper thumbnail of Context dependent ARMA modeling

21st IEEE Convention of the Electrical and Electronic Engineers in Israel. Proceedings (Cat. No.00EX377), 2000

ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of th... more ABSTRACT We propose to extend the use of Risannen's (1983) “tree source”-a relative of the partial hidden Markov model-to continuous signals. While the original algorithm is dedicated to modeling the context in which each symbol can occur in a discrete symbol space, we propose to match a specific ARMA model with each identified context. An example is presented

Bookmarks Related papers MentionsView impact