Turid Hedlund | Hanken School of Economics (original) (raw)
Papers by Turid Hedlund
Background The Internet has recently made possible the free global availability of scientific jou... more Background The Internet has recently made possible the free global availability of scientific journal articles. Open Access (OA) can occur either via OA scientific journals, or via authors posting manuscripts of articles published in subscription journals in open web repositories. So far there have been few systematic studies showing how big the extent of OA is, in particular studies covering all fields of science.
Abstract Compound words form an important part of natural language. From the cross-lingual inform... more Abstract Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds.
Abstract The research problems of the thesis relate to the Scandinavian language Swedish. When th... more Abstract The research problems of the thesis relate to the Scandinavian language Swedish. When the research work on this thesis started, there was very limited knowledge on information retrieval or cross-language information retrieval research in Swedish. The linguistic features of this and other compound rich languages indicate that research focusing on languages of other types than English is of great importance.
Abstract The Internet has technically facilitated making scientific results available to a much w... more Abstract The Internet has technically facilitated making scientific results available to a much wider readership than ever before, both via electronic subscriptions but also for free in the spirit of Open Source licensing of software and the knowledge sharing of Wikipedia. This emerging openness has important implications for better impact of published research in general and for bridging the digital divide between the researchers of the leading universities and the developing nations.
Abstract We used a dictionary-based approach, and performed tests in the bilingual track with thr... more Abstract We used a dictionary-based approach, and performed tests in the bilingual track with three language pairs, ie, Swedish–English (Swe-Eng), Finnish–English (Fin-Eng), and German–English (Ger-Eng). All the source languages are compound languages, ie, languages rich in compound words. A compound word refers to a multi-word expression where the component words are written together.
Abstract Fri tillgång till forskningsresultat var en central fråga för den arbetsgrupp som grunda... more Abstract Fri tillgång till forskningsresultat var en central fråga för den arbetsgrupp som grundades 2003 och tog namnet FinnOA. I en intervju med tre medlemmar i arbetsgruppen diskuterades motiven till att starta och delta i gruppens verksamhet. Gruppen har under åren medverkat till att utarbeta riktlinjer och verksamhetsformer för främjande av fri tillgång till forskningsartiklar men också till att främja tillgången till forskningsdata.
This paper analyzes the features of the Swedish language from the viewpoint of mono-and cross-lan... more This paper analyzes the features of the Swedish language from the viewpoint of mono-and cross-language information retrieval (CLIR). The study was motivated by the fact that Swedish is known poorly from the IR perspective. This paper shows that Swedish has unique features, in particular gender features, the use of fogemorphemes in the formation of compound words, and a high frequency of homographic words. Especially in dictionary-based CLIR, correct word normalization and compound splitting are essential.
Abstract: The present publication forms the documentation of the Nordbib Workshop on Open Access ... more Abstract: The present publication forms the documentation of the Nordbib Workshop on Open Access in Elsinore 23-24 April 2007. The aim of the workshop was to engage policymakers and stakeholders in a discussion about challenges and possibilities for scientific communication and scientific publishing in the Nordic countries.
Abstract The aim of this study was to address the need of further studies on researchers' expecta... more Abstract The aim of this study was to address the need of further studies on researchers' expectancies and attitudes towards open access publishing. In particular we wanted to focus on acceptance and user behavior regarding institutional archives. The approach is domain specific and was based on a framework of theories on intellectual and social organization of the sciences and communication practices in the digital era.
Resumen: Introduction. Describes and analyses the information environment of research work in mol... more Resumen: Introduction. Describes and analyses the information environment of research work in molecular medicine. We presume an interdependence between the information environment, the research process and the related work tasks. Method. This is a qualitative case study using mixed methods. Empirical data were gathered using two surveys and six semi-structured thematic interviews.
Abstract Open access publishing strategies have traditionally been directed towards what has been... more Abstract Open access publishing strategies have traditionally been directed towards what has been regarded as a homogenous scientific community of universities, researchers and libraries. However, discipline specific practices in communication and publishing strategies are prevailing in different scientific areas. In this study we, argue that discipline specific publishing patterns may affect the ways that open access strategies can be adopted in different scientific areas.
Abstract The UTACLIR query translation system was originally designed for the CLEF 2000 and 2001 ... more Abstract The UTACLIR query translation system was originally designed for the CLEF 2000 and 2001 campaigns. In the two first years the query translation application consisted of separate programs based on common translation principles for the language pairs Finnish-English, German-English and Swedish-English. The idea of UTACLIR is based on recognizing distinct source key types and processing them accordingly.
Abstract The Internet has made possible the cost-effective dissemination of scientific journals i... more Abstract The Internet has made possible the cost-effective dissemination of scientific journals in the form of electronic versions, usually in parallel with the printed versions. At the same time the electronic medium also makes possible totally new open access (OA) distribution models, funded by author charges, sponsorship, advertising, voluntary work, etc., where the end product is free in full text to the readers.
The scientific publishing process has during the past few years undergone considerable changes, d... more The scientific publishing process has during the past few years undergone considerable changes, due to the possibilities offered by the Internet for fast delivery and inter-linking of publications which refer to each other. The socio-economic structures have, however, not changed much, and many academics and librarians view the current situation as sub-optimal and highly unsatisfactory.
Abstract In this paper, we analyze and describe the information environment of biomedicine from t... more Abstract In this paper, we analyze and describe the information environment of biomedicine from the point of view of the researchers in molecular medicine, which is a sub branch of biomedicine. We shall describe the nature of the discipline and its reflections to the information environment. A survey concerning the most important information resources in one molecular medicine research unit was conducted, and in this paper the main results of the survey is reported.
Abstract We participated in CLEF'2001 with four automated bilingual runs. UTACLIR is an automatic... more Abstract We participated in CLEF'2001 with four automated bilingual runs. UTACLIR is an automatic query translation and construction system for cross-language information retrieval. The system automatically extracts topical information from request sentences written in one of the source languages and constructs a target language query, based on translations given by a translation dictionary.
Abstract: The scientific publishing process has during the past few years undergone considerable ... more Abstract: The scientific publishing process has during the past few years undergone considerable changes. The socio-economic structures have, however, not changed much, and many academics and librarians view the current situation as highly unsatisfactory. This has triggered a number of initiatives to set up e-print repositories and electronic peer reviewed journals, which usually offer the full text for free on the Web.
Abstract In this paper we will discuss dictionary-based cross-language information retrieval (CLI... more Abstract In this paper we will discuss dictionary-based cross-language information retrieval (CLIR) methods, and report recent findings and problems. We will consider three language pairs for CLIR: Finnish to English, English to Finnish, Swedish to English. We show that Finnish and Swedish have special features, eg, the frequency of homography and a high frequency of compound words that affect retrieval effectiveness. Especially correct word form normalization and compound splitting are essential.
Abstract En landrapport om open access-projekt är av naturen aktuell endast under en relativt kor... more Abstract En landrapport om open access-projekt är av naturen aktuell endast under en relativt kort tid. Aktiviteter inom open access startar ofta som projekt och efterföljs av operationella system som driver aktiviteten vidare. Basen för den här artikeln är resultaten från ett två-årigt projekt"; OA-JES"; som avslutades i december 2007.
Background Like many other industries involved in content delivery, scientific publishing has see... more Background Like many other industries involved in content delivery, scientific publishing has seen new challenges and opportunities with the wide adoption of the Internet. In the early days of the Web, before the 1990s, electronic mailing lists were a popular method for distributing longer strings of text, like journal articles, to groups of people. Since then, technology and web standards have rapidly progressed and matured.
Background The Internet has recently made possible the free global availability of scientific jou... more Background The Internet has recently made possible the free global availability of scientific journal articles. Open Access (OA) can occur either via OA scientific journals, or via authors posting manuscripts of articles published in subscription journals in open web repositories. So far there have been few systematic studies showing how big the extent of OA is, in particular studies covering all fields of science.
Abstract Compound words form an important part of natural language. From the cross-lingual inform... more Abstract Compound words form an important part of natural language. From the cross-lingual information retrieval (CLIR) point of view it is important that many natural languages are highly productive with compounds, and translation resources cannot include entries for all compounds. Also, compounds are often content bearing words in a sentence. In Swedish, German and Finnish roughly one tenth of the words in a text prepared for information retrieval purposes are compounds.
Abstract The research problems of the thesis relate to the Scandinavian language Swedish. When th... more Abstract The research problems of the thesis relate to the Scandinavian language Swedish. When the research work on this thesis started, there was very limited knowledge on information retrieval or cross-language information retrieval research in Swedish. The linguistic features of this and other compound rich languages indicate that research focusing on languages of other types than English is of great importance.
Abstract The Internet has technically facilitated making scientific results available to a much w... more Abstract The Internet has technically facilitated making scientific results available to a much wider readership than ever before, both via electronic subscriptions but also for free in the spirit of Open Source licensing of software and the knowledge sharing of Wikipedia. This emerging openness has important implications for better impact of published research in general and for bridging the digital divide between the researchers of the leading universities and the developing nations.
Abstract We used a dictionary-based approach, and performed tests in the bilingual track with thr... more Abstract We used a dictionary-based approach, and performed tests in the bilingual track with three language pairs, ie, Swedish–English (Swe-Eng), Finnish–English (Fin-Eng), and German–English (Ger-Eng). All the source languages are compound languages, ie, languages rich in compound words. A compound word refers to a multi-word expression where the component words are written together.
Abstract Fri tillgång till forskningsresultat var en central fråga för den arbetsgrupp som grunda... more Abstract Fri tillgång till forskningsresultat var en central fråga för den arbetsgrupp som grundades 2003 och tog namnet FinnOA. I en intervju med tre medlemmar i arbetsgruppen diskuterades motiven till att starta och delta i gruppens verksamhet. Gruppen har under åren medverkat till att utarbeta riktlinjer och verksamhetsformer för främjande av fri tillgång till forskningsartiklar men också till att främja tillgången till forskningsdata.
This paper analyzes the features of the Swedish language from the viewpoint of mono-and cross-lan... more This paper analyzes the features of the Swedish language from the viewpoint of mono-and cross-language information retrieval (CLIR). The study was motivated by the fact that Swedish is known poorly from the IR perspective. This paper shows that Swedish has unique features, in particular gender features, the use of fogemorphemes in the formation of compound words, and a high frequency of homographic words. Especially in dictionary-based CLIR, correct word normalization and compound splitting are essential.
Abstract: The present publication forms the documentation of the Nordbib Workshop on Open Access ... more Abstract: The present publication forms the documentation of the Nordbib Workshop on Open Access in Elsinore 23-24 April 2007. The aim of the workshop was to engage policymakers and stakeholders in a discussion about challenges and possibilities for scientific communication and scientific publishing in the Nordic countries.
Abstract The aim of this study was to address the need of further studies on researchers' expecta... more Abstract The aim of this study was to address the need of further studies on researchers' expectancies and attitudes towards open access publishing. In particular we wanted to focus on acceptance and user behavior regarding institutional archives. The approach is domain specific and was based on a framework of theories on intellectual and social organization of the sciences and communication practices in the digital era.
Resumen: Introduction. Describes and analyses the information environment of research work in mol... more Resumen: Introduction. Describes and analyses the information environment of research work in molecular medicine. We presume an interdependence between the information environment, the research process and the related work tasks. Method. This is a qualitative case study using mixed methods. Empirical data were gathered using two surveys and six semi-structured thematic interviews.
Abstract Open access publishing strategies have traditionally been directed towards what has been... more Abstract Open access publishing strategies have traditionally been directed towards what has been regarded as a homogenous scientific community of universities, researchers and libraries. However, discipline specific practices in communication and publishing strategies are prevailing in different scientific areas. In this study we, argue that discipline specific publishing patterns may affect the ways that open access strategies can be adopted in different scientific areas.
Abstract The UTACLIR query translation system was originally designed for the CLEF 2000 and 2001 ... more Abstract The UTACLIR query translation system was originally designed for the CLEF 2000 and 2001 campaigns. In the two first years the query translation application consisted of separate programs based on common translation principles for the language pairs Finnish-English, German-English and Swedish-English. The idea of UTACLIR is based on recognizing distinct source key types and processing them accordingly.
Abstract The Internet has made possible the cost-effective dissemination of scientific journals i... more Abstract The Internet has made possible the cost-effective dissemination of scientific journals in the form of electronic versions, usually in parallel with the printed versions. At the same time the electronic medium also makes possible totally new open access (OA) distribution models, funded by author charges, sponsorship, advertising, voluntary work, etc., where the end product is free in full text to the readers.
The scientific publishing process has during the past few years undergone considerable changes, d... more The scientific publishing process has during the past few years undergone considerable changes, due to the possibilities offered by the Internet for fast delivery and inter-linking of publications which refer to each other. The socio-economic structures have, however, not changed much, and many academics and librarians view the current situation as sub-optimal and highly unsatisfactory.
Abstract In this paper, we analyze and describe the information environment of biomedicine from t... more Abstract In this paper, we analyze and describe the information environment of biomedicine from the point of view of the researchers in molecular medicine, which is a sub branch of biomedicine. We shall describe the nature of the discipline and its reflections to the information environment. A survey concerning the most important information resources in one molecular medicine research unit was conducted, and in this paper the main results of the survey is reported.
Abstract We participated in CLEF'2001 with four automated bilingual runs. UTACLIR is an automatic... more Abstract We participated in CLEF'2001 with four automated bilingual runs. UTACLIR is an automatic query translation and construction system for cross-language information retrieval. The system automatically extracts topical information from request sentences written in one of the source languages and constructs a target language query, based on translations given by a translation dictionary.
Abstract: The scientific publishing process has during the past few years undergone considerable ... more Abstract: The scientific publishing process has during the past few years undergone considerable changes. The socio-economic structures have, however, not changed much, and many academics and librarians view the current situation as highly unsatisfactory. This has triggered a number of initiatives to set up e-print repositories and electronic peer reviewed journals, which usually offer the full text for free on the Web.
Abstract In this paper we will discuss dictionary-based cross-language information retrieval (CLI... more Abstract In this paper we will discuss dictionary-based cross-language information retrieval (CLIR) methods, and report recent findings and problems. We will consider three language pairs for CLIR: Finnish to English, English to Finnish, Swedish to English. We show that Finnish and Swedish have special features, eg, the frequency of homography and a high frequency of compound words that affect retrieval effectiveness. Especially correct word form normalization and compound splitting are essential.
Abstract En landrapport om open access-projekt är av naturen aktuell endast under en relativt kor... more Abstract En landrapport om open access-projekt är av naturen aktuell endast under en relativt kort tid. Aktiviteter inom open access startar ofta som projekt och efterföljs av operationella system som driver aktiviteten vidare. Basen för den här artikeln är resultaten från ett två-årigt projekt"; OA-JES"; som avslutades i december 2007.
Background Like many other industries involved in content delivery, scientific publishing has see... more Background Like many other industries involved in content delivery, scientific publishing has seen new challenges and opportunities with the wide adoption of the Internet. In the early days of the Web, before the 1990s, electronic mailing lists were a popular method for distributing longer strings of text, like journal articles, to groups of people. Since then, technology and web standards have rapidly progressed and matured.