Large Pre-Trained Models with Extra-Large Vocabularies: A Contrastive Analysis of Hebrew BERT Models and a New One to Outperform Them All (original) (raw)

AlephBERT: A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application With

Dan Bareket

2021

View PDFchevron_right

Choosing an optimal architecture for segmentation and POS-tagging of Modern Hebrew

Khalil Sima'an

2005

View PDFchevron_right

Noun phrase chunking in hebrew: Influence of lexical and morphological features

Michael Elhadad

2006

View PDFchevron_right

Representations and Architectures in Neural Sentiment Analysis for Morphologically Rich Languages: A Case Study from Modern Hebrew

Anat Ben-David

View PDFchevron_right

Joint Hebrew segmentation and parsing using a PCFG-LA lattice parser

Michael Elhadad

2011

View PDFchevron_right

Investigating the effect of sub-word segmentation on the performance of transformer language models

Anisya Katinskaya

arXiv (Cornell University), 2023

View PDFchevron_right

Basic Word Completion and Prediction for Hebrew

Izek Greenfield

Lecture Notes in Computer Science, 2012

View PDFchevron_right

HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark

Hilla Merhav

2022

View PDFchevron_right

An Unsupervised Morpheme-Based HMM for Hebrew

Michael Elhadad

View PDFchevron_right

Accurate Unlexicalized Parsing for Modern Hebrew

Khalil Sima'an

Lecture Notes in Computer Science, 2007

View PDFchevron_right

Hebrew computational linguistics: Past and future

Shuly Wintner

Artificial Intelligence Review, 2004

View PDFchevron_right

Hebrew Named Entity Recognition

Michael Elhadad

MONEY

View PDFchevron_right

Building a tree-bank of modern Hebrew text

Alon Itai

… Automatique des Langues, 2001

View PDFchevron_right

Experiments with Language Models for Word Completion and Prediction in Hebrew

Yaakov Hacohen-Kerner

Lecture Notes in Computer Science, 2014

View PDFchevron_right

A Novel Challenge Set for Hebrew Morphological Disambiguation and Diacritics Restoration

Avi Shmidman

Findings of the Association for Computational Linguistics: EMNLP 2020

View PDFchevron_right

Overview of the progression of state-of-the-art language models

TELKOMNIKA JOURNAL

TELKOMNIKA Telecommunication Computing Electronics and Control, 2024

View PDFchevron_right

A morphologically annotated Hebrew CHILDES corpus

Brian Macwhinney

View PDFchevron_right

A computational lexicon of contemporary Hebrew

Shuly Wintner

… of The fifth international conference on …, 2006

View PDFchevron_right

On Losses for Modern Language Models

Stéphane Aroca-Ouellette, Frank Rudzicz

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

View PDFchevron_right

An unsupervised morpheme-based HMM for hebrew morphological disambiguation

Michael Elhadad

Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL - ACL '06, 2006

View PDFchevron_right

A Large and Diverse Arabic Corpus for Language Modeling

Abbas Ali

arXiv (Cornell University), 2022

View PDFchevron_right

Smoothing a lexicon-based POS tagger for Arabic and Hebrew

Khalil Sima'an

Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages Common Issues and Resources - Semitic '07, 2007

View PDFchevron_right

Linguistic Variations in Classical Hebrew: from Markov Models to Neural Networks

Yanniek van der Schans, David Ruhe

View PDFchevron_right

Toward Better Understanding of Hebrew NP Chunks

Michael Elhadad

2007

View PDFchevron_right

Sigmorphon 2019 Task 2 system description paper: Morphological analysis in context for many languages, with supervision from only a few

Alexis Palmer

Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology

View PDFchevron_right

Word Segmentation, Unknown-word Resolution, and Morphological Agreement in a Hebrew Parsing System

Michael Elhadad

Computational Linguistics, 2013

View PDFchevron_right

Automatic Thesaurus Construction for Modern Hebrew

Jonathan Schler

2018

View PDFchevron_right

Benchmarking Arabic AI with Large Language Models

Firoj Alam

arXiv (Cornell University), 2023

View PDFchevron_right

Designing CoSIH: The Corpus of Spoken Israeli Hebrew

Giora Rahav

International Journal of Corpus Linguistics, 2001

View PDFchevron_right

SVM model tampering and anchored learning: a case study in Hebrew NP chunking

Michael Elhadad

2007

View PDFchevron_right

On the Importance of Tokenization in Arabic Embedding Models

Mairaj Syed

Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

View PDFchevron_right

A Transformer-based Parser for Syriac Morphology

Martijn Naaijer

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

View PDFchevron_right

A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek

Els Lefever

Proceedings of the 5th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, 2021

View PDFchevron_right