Alejandro Bia | Universidad Miguel Hernandez (original) (raw)

Papers by Alejandro Bia

Research paper thumbnail of Metadata requirements and solutions at the Digital Library of the National Library of Spain

This article describes the requirements and technological solutions adopted by the National Libra... more This article describes the requirements and technological solutions adopted by the National Library of Spain for its Digital Library section concerning metadata. It also discusses different approaches for metadata handling in general. ♣

Research paper thumbnail of Exploring the teacher asresearcher synergy through active learning methodologies

Research paper thumbnail of Developing a simple production and dissemination model for a complex DL-news service using XML, XSLT and TEI ♣

Electronic publishing makes it possible to reach every corner of the world and opens up new resea... more Electronic publishing makes it possible to reach every corner of the world and opens up new research and communication paths. In this article we describe the production model and implementation of an electronic news service for a DL, that manages altogether five different DL-newsletters plus a monthly journal, each one of them delivered in several optional output formats, and all managed and produced based on XML-TEI and XSLT technology. Articles and news may appear in different newsletters and/or the journal, and they have different periodicity (some are published quarterly and some monthly). The news and articles come from different sources, which generally coincide with departments or units of the DL. A general editor reviews the articles or news, decides where must they appear, and also manages the distribution lists. The newsletters are not preserved, but the most relevant news and articles are published in the monthly journal called Dulcinea, which is published through Interne...

Research paper thumbnail of Automating the production of facsimiles and transcriptions for manuscripts and rare old books at the Miguel de Cervantes Digital Library

Research paper thumbnail of Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

The largest effort in the area of standardisation of computer encoding of language resources has ... more The largest effort in the area of standardisation of computer encoding of language resources has been the Text Encoding Initiative (TEI), established in 1987. TEI chose as its underlying standard SGML (Standard Generalized Markup Language), and in the years before the inception of XML, a number of projects encoded their data according to some SGML DTD, TEI compliant, or otherwise. These projects could now benefit from migrating their data to XML. Apart from validation, the most compelling reason for migration is the scarcity of SGML-aware software and the abundance of XML-based tools and related recommendations. However, despite the fact that XML is a subset of SGML, migration is not a trivial process, especially in the case of large holdings of legacy language resources. This is why in 2002 the TEI Consortium established a Task Force on SGML to XML migration. The TF has now produced a number of reports that simplify and make explicit the conversion of SGML TEI (version P3) to XML T...

Research paper thumbnail of Customizing user access to the Miguel de Cervantes Digital Library

According to [1] Internet is a market with many micromarkets, based on needs, interests and trend... more According to [1] Internet is a market with many micromarkets, based on needs, interests and trends, both personal and professional. Each and every space of the net is atomized to reach the users, with their own preferences and behaviors. Our DL project intends to give the users a customized view where they could receive personalized information. We describe the goals and design ideas followed in our customization project.

Research paper thumbnail of Desarrollo de una política de preservación digital: tecnología

Research paper thumbnail of Information Retrieval in Digital Libraries : efficient catalog searches using tries

Digital Libraries of literary works usually store a huge amount of textual information. It is obv... more Digital Libraries of literary works usually store a huge amount of textual information. It is obvious that the mere accumulation of texts leads only to a limited-use library. Hence the need for eÆcient information retrieval services. The use of indices to speed up the search is advisable in cases like ours, the \Miguel de Cervantes" digital library, where the text collection is relatively large (4000 books at present) and semi-static (updated at reasonably large intervals). We developed a catalog search engine based on tries that performs fast searches with eÆcient results. However the size of the trie structures is quite big, proving adequate for catalog searches, but not for whole text indexing.

Research paper thumbnail of El Método "Software Factory": acciones para realizar prácticas más realistas, usando herramientasWeb de trabajo colaborativo, y trabajo a distancia

El metodo Software Factory (SF) consiste en llevar a cabo acciones y aplicar herramientas informa... more El metodo Software Factory (SF) consiste en llevar a cabo acciones y aplicar herramientas informaticas, con dos finalidades: (1) hacer que las practicas de asignaturas tecnologicas (en un sentido amplio) sean lo mas parecidas que sea posible al trabajo real en la empresa o la industria y (2) difundir el uso de herramientas Web que permitan nuevas formas de trabajo agiles y dinamicas, como por ejemplo el teletrabajo, el trabajo movil y el trabajo colaborativo a distancia. Este articulo describe las experiencias de los autores en este sentido.

Research paper thumbnail of Using the MatchDetectReveal System for Comparative Analysis of Texts

In this paper we are introducing the MatchDetectReveal system, which is capable of identifying th... more In this paper we are introducing the MatchDetectReveal system, which is capable of identifying the similarity between documents. Different applications of the system are discussed including cross-referencing multiple editions of literary works, plagiarism detection, organizing collections of documents and comparative analysis of texts. The system uses suffix trees and suffix vectors for comparing documents. These data structures are very fast and powerful, which allows fast comparison of documents. The front-end of the system is fully Web-based, thus users only need to use a Web browser to access the system. The results are also presented as HTML files utilising the hyperlink capabilities of HTML documents.

Research paper thumbnail of Automating the internal workflow of the Miguel de Cervantes Digital Library

We describe the digital-book-production flow of the Miguel de Cervantes Virtual Library, from boo... more We describe the digital-book-production flow of the Miguel de Cervantes Virtual Library, from book acquisition up to Internet publishing, highlighting the main requirements and design considerations of the workflow system.

Research paper thumbnail of TBL aplicado a la Ingeniería de Software

Research paper thumbnail of A reusable hypermedia design for DL manuscripts , based on XML , XSLT and Java

The purpose of this article is to describe our approach to the massive production of facsimile-ty... more The purpose of this article is to describe our approach to the massive production of facsimile-type hypertext books that contain digital images of manuscripts and old printings to be published on the Internet as one of our DL services . The goal of this project is to offer an easy-to-use interface that allows customizable views of facsimile images of book pages in several sizes and formats with transcriptions that are offered in three forms: Madison style, normal, and modernized. We explain the hypertext design, and the time-saving production methodology we use.

Research paper thumbnail of Information Extraction to feed Digital Library Databases

Most often, Digital Libraries have the need to extract information from poorly marked-up document... more Most often, Digital Libraries have the need to extract information from poorly marked-up documents to fill databases or create new hypertext documents with a highly structured markup. In this work, we approach the problem of extracting bibliographic information from literary reports in HTML format to fill a Digital Library database of Galician publications used for Internet searchs. An information extraction approach that takes advantage of both HTML markup and Natural Language Processing (NLP) techniques was successfully used for this purpose.

Research paper thumbnail of Deploying Team-Based Learning at Undergraduate Software Engineering Courses

2017 IEEE/ACM 1st International Workshop on Software Engineering Curricula for Millennials (SECM)

Education methods for millennials must accommodate their expectations and behaviors. Active learn... more Education methods for millennials must accommodate their expectations and behaviors. Active learning methodologies seem to be adequate for this requirement. In particular, in this paper, we discuss the design and deployment of Team-Based Learning (TBL) in two undergraduate Software Engineering courses. TBL is a type of Active Learning Methodology that makes extensive use of small groups to accommodate learning and empower students with the learning responsibilities in the classroom. This paper describes our concerns and the decisions we made when designing two TBL courses at ORT University. Furthermore, we evaluated the results of our deployment and compared them with published results. Our results are aligned with the expectations inferred from the literature. Students had a positive perception of the methodology and the learning outcomes.

Research paper thumbnail of Promoting the Use of Technical Standards Inindustry-Oriented Engineering Education

Педагогически форум

The role of technical standards (TS) has become increasingly important for engineering over the p... more The role of technical standards (TS) has become increasingly important for engineering over the past years. Nowadays, undergraduates are not only our future colleagues in industry and academia, but they also constitute the future workforce of a very significant part of the industry. The accomplishments of professional activities require a correct understanding of the role of technical standards applied to the industry, especially within the computing and the telecommunications fields. One of the challenges is to find the right way to introduce technical standards to enhance the learning experience by pointing students to the best practices of the industry. We found a stimulating alternative to overcome this challenge through the IEEE Student Branch Program. This paper presents one case of success, the IEEE's Student Branch in Spain during 2014, 2015 and 2017 and the Exemplary Student Branch Award (2017 to 2019) of Region 8 (Europe, Middle East and Africa).

Research paper thumbnail of Tool Support for Model-Driven Development of Web Applications

This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the produ... more This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the production of Web applications. VisualWADE follows a model-driven approach focusing on requirements analysis, high level design, and rapid prototyping. In this way, an application evolves smoothly from the first prototype to the final product, and its maintenance is a natural consequence of development. The paper also discusses the lessons learned in the development of the tool and its application to several case studies in the industrial context.

Research paper thumbnail of UML for document modeling: designing document structures for massive and systematic production of XML-based web contents

Proceedings of the 8th International Conference on Model Driven Engineering Languages and Systems, 2005

Research paper thumbnail of Automating multilingual metadata vocabularies

International Conference on Dublin Core and Metadata Applications, Sep 12, 2005

Research paper thumbnail of Alopex-B: a new, simpler, but yet faster version of the alopex training algorithm

International Journal of Neural Systems, 2002

Research paper thumbnail of Metadata requirements and solutions at the Digital Library of the National Library of Spain

This article describes the requirements and technological solutions adopted by the National Libra... more This article describes the requirements and technological solutions adopted by the National Library of Spain for its Digital Library section concerning metadata. It also discusses different approaches for metadata handling in general. ♣

Research paper thumbnail of Exploring the teacher asresearcher synergy through active learning methodologies

Research paper thumbnail of Developing a simple production and dissemination model for a complex DL-news service using XML, XSLT and TEI ♣

Electronic publishing makes it possible to reach every corner of the world and opens up new resea... more Electronic publishing makes it possible to reach every corner of the world and opens up new research and communication paths. In this article we describe the production model and implementation of an electronic news service for a DL, that manages altogether five different DL-newsletters plus a monthly journal, each one of them delivered in several optional output formats, and all managed and produced based on XML-TEI and XSLT technology. Articles and news may appear in different newsletters and/or the journal, and they have different periodicity (some are published quarterly and some monthly). The news and articles come from different sources, which generally coincide with departments or units of the DL. A general editor reviews the articles or news, decides where must they appear, and also manages the distribution lists. The newsletters are not preserved, but the most relevant news and articles are published in the monthly journal called Dulcinea, which is published through Interne...

Research paper thumbnail of Automating the production of facsimiles and transcriptions for manuscripts and rare old books at the Miguel de Cervantes Digital Library

Research paper thumbnail of Migrating Language Resources from SGML to XML: the Text Encoding Initiative Recommendations

The largest effort in the area of standardisation of computer encoding of language resources has ... more The largest effort in the area of standardisation of computer encoding of language resources has been the Text Encoding Initiative (TEI), established in 1987. TEI chose as its underlying standard SGML (Standard Generalized Markup Language), and in the years before the inception of XML, a number of projects encoded their data according to some SGML DTD, TEI compliant, or otherwise. These projects could now benefit from migrating their data to XML. Apart from validation, the most compelling reason for migration is the scarcity of SGML-aware software and the abundance of XML-based tools and related recommendations. However, despite the fact that XML is a subset of SGML, migration is not a trivial process, especially in the case of large holdings of legacy language resources. This is why in 2002 the TEI Consortium established a Task Force on SGML to XML migration. The TF has now produced a number of reports that simplify and make explicit the conversion of SGML TEI (version P3) to XML T...

Research paper thumbnail of Customizing user access to the Miguel de Cervantes Digital Library

According to [1] Internet is a market with many micromarkets, based on needs, interests and trend... more According to [1] Internet is a market with many micromarkets, based on needs, interests and trends, both personal and professional. Each and every space of the net is atomized to reach the users, with their own preferences and behaviors. Our DL project intends to give the users a customized view where they could receive personalized information. We describe the goals and design ideas followed in our customization project.

Research paper thumbnail of Desarrollo de una política de preservación digital: tecnología

Research paper thumbnail of Information Retrieval in Digital Libraries : efficient catalog searches using tries

Digital Libraries of literary works usually store a huge amount of textual information. It is obv... more Digital Libraries of literary works usually store a huge amount of textual information. It is obvious that the mere accumulation of texts leads only to a limited-use library. Hence the need for eÆcient information retrieval services. The use of indices to speed up the search is advisable in cases like ours, the \Miguel de Cervantes" digital library, where the text collection is relatively large (4000 books at present) and semi-static (updated at reasonably large intervals). We developed a catalog search engine based on tries that performs fast searches with eÆcient results. However the size of the trie structures is quite big, proving adequate for catalog searches, but not for whole text indexing.

Research paper thumbnail of El Método "Software Factory": acciones para realizar prácticas más realistas, usando herramientasWeb de trabajo colaborativo, y trabajo a distancia

El metodo Software Factory (SF) consiste en llevar a cabo acciones y aplicar herramientas informa... more El metodo Software Factory (SF) consiste en llevar a cabo acciones y aplicar herramientas informaticas, con dos finalidades: (1) hacer que las practicas de asignaturas tecnologicas (en un sentido amplio) sean lo mas parecidas que sea posible al trabajo real en la empresa o la industria y (2) difundir el uso de herramientas Web que permitan nuevas formas de trabajo agiles y dinamicas, como por ejemplo el teletrabajo, el trabajo movil y el trabajo colaborativo a distancia. Este articulo describe las experiencias de los autores en este sentido.

Research paper thumbnail of Using the MatchDetectReveal System for Comparative Analysis of Texts

In this paper we are introducing the MatchDetectReveal system, which is capable of identifying th... more In this paper we are introducing the MatchDetectReveal system, which is capable of identifying the similarity between documents. Different applications of the system are discussed including cross-referencing multiple editions of literary works, plagiarism detection, organizing collections of documents and comparative analysis of texts. The system uses suffix trees and suffix vectors for comparing documents. These data structures are very fast and powerful, which allows fast comparison of documents. The front-end of the system is fully Web-based, thus users only need to use a Web browser to access the system. The results are also presented as HTML files utilising the hyperlink capabilities of HTML documents.

Research paper thumbnail of Automating the internal workflow of the Miguel de Cervantes Digital Library

We describe the digital-book-production flow of the Miguel de Cervantes Virtual Library, from boo... more We describe the digital-book-production flow of the Miguel de Cervantes Virtual Library, from book acquisition up to Internet publishing, highlighting the main requirements and design considerations of the workflow system.

Research paper thumbnail of TBL aplicado a la Ingeniería de Software

Research paper thumbnail of A reusable hypermedia design for DL manuscripts , based on XML , XSLT and Java

The purpose of this article is to describe our approach to the massive production of facsimile-ty... more The purpose of this article is to describe our approach to the massive production of facsimile-type hypertext books that contain digital images of manuscripts and old printings to be published on the Internet as one of our DL services . The goal of this project is to offer an easy-to-use interface that allows customizable views of facsimile images of book pages in several sizes and formats with transcriptions that are offered in three forms: Madison style, normal, and modernized. We explain the hypertext design, and the time-saving production methodology we use.

Research paper thumbnail of Information Extraction to feed Digital Library Databases

Most often, Digital Libraries have the need to extract information from poorly marked-up document... more Most often, Digital Libraries have the need to extract information from poorly marked-up documents to fill databases or create new hypertext documents with a highly structured markup. In this work, we approach the problem of extracting bibliographic information from literary reports in HTML format to fill a Digital Library database of Galician publications used for Internet searchs. An information extraction approach that takes advantage of both HTML markup and Natural Language Processing (NLP) techniques was successfully used for this purpose.

Research paper thumbnail of Deploying Team-Based Learning at Undergraduate Software Engineering Courses

2017 IEEE/ACM 1st International Workshop on Software Engineering Curricula for Millennials (SECM)

Education methods for millennials must accommodate their expectations and behaviors. Active learn... more Education methods for millennials must accommodate their expectations and behaviors. Active learning methodologies seem to be adequate for this requirement. In particular, in this paper, we discuss the design and deployment of Team-Based Learning (TBL) in two undergraduate Software Engineering courses. TBL is a type of Active Learning Methodology that makes extensive use of small groups to accommodate learning and empower students with the learning responsibilities in the classroom. This paper describes our concerns and the decisions we made when designing two TBL courses at ORT University. Furthermore, we evaluated the results of our deployment and compared them with published results. Our results are aligned with the expectations inferred from the literature. Students had a positive perception of the methodology and the learning outcomes.

Research paper thumbnail of Promoting the Use of Technical Standards Inindustry-Oriented Engineering Education

Педагогически форум

The role of technical standards (TS) has become increasingly important for engineering over the p... more The role of technical standards (TS) has become increasingly important for engineering over the past years. Nowadays, undergraduates are not only our future colleagues in industry and academia, but they also constitute the future workforce of a very significant part of the industry. The accomplishments of professional activities require a correct understanding of the role of technical standards applied to the industry, especially within the computing and the telecommunications fields. One of the challenges is to find the right way to introduce technical standards to enhance the learning experience by pointing students to the best practices of the industry. We found a stimulating alternative to overcome this challenge through the IEEE Student Branch Program. This paper presents one case of success, the IEEE's Student Branch in Spain during 2014, 2015 and 2017 and the Exemplary Student Branch Award (2017 to 2019) of Region 8 (Europe, Middle East and Africa).

Research paper thumbnail of Tool Support for Model-Driven Development of Web Applications

This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the produ... more This paper describes the engineering foundations of VisualWADE, a CASE tool to automate the production of Web applications. VisualWADE follows a model-driven approach focusing on requirements analysis, high level design, and rapid prototyping. In this way, an application evolves smoothly from the first prototype to the final product, and its maintenance is a natural consequence of development. The paper also discusses the lessons learned in the development of the tool and its application to several case studies in the industrial context.

Research paper thumbnail of UML for document modeling: designing document structures for massive and systematic production of XML-based web contents

Proceedings of the 8th International Conference on Model Driven Engineering Languages and Systems, 2005

Research paper thumbnail of Automating multilingual metadata vocabularies

International Conference on Dublin Core and Metadata Applications, Sep 12, 2005

Research paper thumbnail of Alopex-B: a new, simpler, but yet faster version of the alopex training algorithm

International Journal of Neural Systems, 2002