Publications
All publications of the Lab.
2022
-
German Medical Natural Language Processing–A Data-centric SurveyApplications in Medicine and Manufacturing Nov 2022
-
CNN-based Ruled Line Removal in Handwritten DocumentsIn Proceedings of the 18th International Conference on Frontiers of Handwriting Recognition (ICFHR 2022) Dec 2022
-
Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022)In Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2022) Jul 2022
2021
-
Implicit Phenomena in Short-answer Scoring DataIn Proceedings of the First Workshop on Understanding Implicit and Underspecified Language Aug 2021
-
-
Personalizing Handwriting Recognition Systems with Limited User-Specific SamplesIn Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR 2021) Aug 2021
-
Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational ApplicationsIn Proceedings of the 16th International Conference on Document Analysis and Recognition (ICDAR 2021) Apr 2021
-
C-Test Collector: A Proficiency Testing Application to Collect Training Data for C-TestsIn Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications Apr 2021
-
Fully vs. Weakly Supervised Caries Localization in Smartphone Images with CNNsIn Artificial Intelligence for Healthcare Applications International Workshop - ICPR 2020 Workshop Proceedings Jan 2021
2020
-
Don’t take "nswvtnvakgxpm" for an answer - The surprising vulnerability of automatic content scoring systems to adversarial inputIn Proceedings of the 28th International Conference on Computational Linguistics(COLING 2020) Jan 2020
-
Chinese Content Scoring: Open-Access Datasets and Features on Different Segmentation LevelsIn Proceedings of the 1st conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing(AACL-IJCNLP 2020) Jan 2020
-
Digital Transformation: A unique Chance to Shape the FutureIn Proceedings of the 1st conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing(AACL-IJCNLP 2020) Jan 2020
-
Automated Scoring of Teachers’ Pedagogical Content Knowledge - A Comparison between Human and Machine ScoringFrontiers in Education Jan 2020
-
Appropriateness and Pedagogic Usefulness of Reading Comprehension QuestionsIn Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC-2020) Jan 2020
-
Exploring the Impact of Handwriting Recognition on the Automated Scoring of Handwritten Student AnswersIn Proceedings of the 17th International Conference on Frontiers in Handwriting Recognition (ICFHR 2020) Jan 2020
-
Decomposing and Comparing Meaning Relations: Paraphrasing, Textual Entailment, Contradiction, and SpecificityIn Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC-2020) Jan 2020
2019
-
A survey of semantic relatedness evaluation datasets and proceduresArtificial Intelligence Review Jan 2019
-
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational ApplicationsIn Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications Sep 2019
-
German End-to-end Speech Recognition based on DeepSpeechIn Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers Sep 2019
-
Annotating and analyzing the interactions between meaning relationsIn Proceedings of the 13th Linguistic Annotation Workshop Sep 2019
-
Automatic Diacritization as Prerequisite Towards the Automatic Generation of Arabic Lexical Recognition TestsIn Proceedings of the 3rd International Conference on Natural Language and Speech Processing Sep 2019
-
RELATIONS-Workshop on meaning relations between phrases and sentencesIn RELATIONS-Workshop on meaning relations between phrases and sentences Sep 2019
-
The Influence of Variance in Learner Answers on Automatic Content ScoringFrontiers in Education Sep 2019
-
From legal to technical concept: Towards an automated classification of German political Twitter postings as criminal offensesIn Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL) Sep 2019
-
ltl.uni-due at SemEval 2019 Task 5: Simple but Effective Lexico-Semantic Features for Detecting Hate Speech in TwitterIn Proceedings of the International Workshop on Semantic Evaluation (SemEval) Sep 2019
-
LTL-UDE at SemEval-2019 Task 6: BERT and Two-Vote Classification for Categorizing OffensivenessIn Proceedings of the International Workshop on Semantic Evaluation (SemEval) Sep 2019
-
Computer-assisted Understanding of Stance in Social Media: Formalizations, Data Creation, and Prediction ModelsIn Proceedings of the International Workshop on Semantic Evaluation (SemEval) Sep 2019
2018
-
Corpus of Aspect-based Sentiment in Political DebatesIn KONVENS Sep 2018
-
Do Women Perceive Hate Differently: Examining the Relationship Between Hate Speech, Gender, and Agreement JudgmentsIn Proceedings of the 14th Conference on Natural Language Processing (KONVENS 2018) Sep 2018
-
The Role of Diacritics in Increasing the Difficulty of Arabic Lexical Recognition TestsIn Proceedings of the 7th Workshop on NLP for Computer Assisted Language Learning at SLTC 2018 (NLP4CALL 2018) Sep 2018
-
A flexible online system for curating reduced redundancy language exercises and testsIn Future-proof CALL: language learning as exploration and encounters – short papers from EUROCALL 2018 Sep 2018
-
ESCRITO-An NLP-Enhanced Educational Scoring ToolkitIn Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018) Sep 2018
-
Quantifying qualitative data for understanding controversial issuesIn Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018) Sep 2018
-
DeepTC–An Extension of DKPro Text Classification for Fostering Reproducibility of Deep Learning ExperimentsIn Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018) Sep 2018
-
Robust Part-of-Speech Tagging of Social Media TextIn Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018) Sep 2018
-
-
Exploring the Effects of Diacritization on Arabic Frequency CountsIn International Conference on Natural Language and Speech Processing (ICNLSP 2018) Sep 2018
-
Cross-lingual Content ScoringIn Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications Sep 2018
2017
-
A Survey and Comparative Study of Arabic Diacritization ToolsJLCL: Special Issue-NLP for Perso-Arabic Alphabets Sep 2017
-
The Influence of Spelling Error on Content Scoring PerformanceIn Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications Sep 2017
-
GermEval 2017: Shared Task on Aspect-based Sentiment in Social Media Customer FeedbackIn Proceedings of the GermEval 2017 Shared Task on Aspect-based Sentiment in Social Media Customer Feedback Sep 2017
-
Same same, but different: Compositionality of paraphrase granularity levelsIn Proceedings of the Recent Advances in Natural Language Processing (RANLP-2017) Sep 2017
-
The Role of Diacritics in Designing Lexical Recognition Tests for ArabicIn 3rd International Conference on Arabic Computational Linguistics (ACLing 2017) Sep 2017
-
Part-of-speech tagging for corpora of computer-mediated communication: A case study on finding rare phenomenaIn 3rd International Conference on Arabic Computational Linguistics (ACLing 2017) Sep 2017
-
Investigating neural architectures for short answer scoringIn Proceedings of the Building Educational Applications Workshop at EMNLP Sep 2017
-
Neural, Non-neural and Hybrid Stance Detection in Tweets on Catalan IndependenceIn Stance and Gender Detection in Tweets on Catalan Independence at Ibereval 2017 Sep 2017
-
Fine-grained essay scoring of a complex writing task for native speakersIn Proceedings of the Building Educational Applications Workshop at EMNLP Sep 2017
-
Reliable Part-of-Speech Tagging of Low-frequency Phenomena in the Social Media DomainIn Proceedings of the Conference on CMC and Social Media Corpora for the Humanities Sep 2017
-
Do LSTMs really work so well for PoS tagging? – A replication studyIn Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) Sep 2017
-
What does this imply? Examining the Impact of Implicitness on the Perception of Hate SpeechIn Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL-2017) Sep 2017
2016
-
LTL-UDE at EmpiriST 2015: Tokenization and PoS Tagging of Social Media TextIn Proceedings of the 10th Web as Corpus Workshop (WAC-X) and the EmpiriST Shared Task Sep 2016
-
Stance-based Argument Mining – Modeling Implicit Argumentation Using StanceIn Proceedings of the KONVENS Sep 2016
-
ltl.uni-due at SemEval-2016 Task 6: Stance Detection in Social Media Using Stacked ClassifiersIn Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval 2016) Sep 2016
-
Assigning Fine-grained PoS Tags based on High-precision Coarse-grained TaggingIn Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers Sep 2016
-
Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooksIn Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers Sep 2016
-
Predicting the Spelling Difficulty of Words for Language LearnersIn Proceedings of the Building Educational Applications Workshop at NAACL Sep 2016
-
Bundled Gap Filling: A New Paradigm for Unambiguous Cloze Exercises.In Proceedings of the Building Educational Applications Workshop at NAACL Sep 2016
-
FlexTag: A Highly Flexible Pos Tagging FrameworkIn Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) Sep 2016
-
Building a Social Media Adapted PoS Tagger Using FlexTag – A Case Study on Italian TweetsIn Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian - EVALITA 2016 Sep 2016
-
Validating Bundled Gap Filling – Empirical Evidence for Ambiguity Reduction and Language Proficiency Testing CapabilitiesIn Proceedings of the NLP4CALL at SLTC 2016 Sep 2016
-
Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee CrisisIn Proceedings of NLP4CMC III: 3rd Workshop on Natural Language Processing for Computer-Mediated Communication Sep 2016
2015
-
Candidate evaluation strategies for improved difficulty prediction of language testsIn Proceedings of the Building Educational Applications Workshop at NAACL Sep 2015
-
Fast or Accurate ? – A Comparative Evaluation of PoS Tagging ModelsIn Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL-2015) Sep 2015
-
Reducing Annotation Efforts in Supervised Short Answer ScoringIn Proceedings of the Building Educational Applications Workshop at NAACL Sep 2015
-
Task-Independent Features for Automated Essay GradingIn Proceedings of the Building Educational Applications Workshop at NAACL Sep 2015
-
Counting What Counts: Decompounding for Keyphrase ExtractionIn Counting What Counts: Decompounding for Keyphrase Extraction Sep 2015
-
Effectiveness of Domain Adaptation Approaches for Social Media PoS TaggingIn Proceeding of the Second Italian Conference on Computational Linguistics Sep 2015
-
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)In Proceeding of the Second Italian Conference on Computational Linguistics Sep 2015
-
Composing Measures for Computing Text SimilarityIn Proceeding of the Second Italian Conference on Computational Linguistics Sep 2015
-
Generating Nonwords for Vocabulary Proficiency TestingIn Proceeding of the 7th Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics Sep 2015
2014
-
DKPro Keyphrases: Flexible and Reusable Keyphrase Extraction ExperimentsIn Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. System Demonstrations Sep 2014
-
Readability for foreign language learning: The importance of cognatesInternational Journal of Applied Linguistics Sep 2014
-
Automatic Generation of Challenging Distractors Using Context-Sensitive Inference RulesIn Proceedings of the 9th Workshop on Innovative Use of NLP for Building Educational Applications at ACL Sep 2014
-
DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual DataIn Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. System Demonstrations Sep 2014
-
Sense and Similarity: A Study of Sense-level Similarity MeasuresIn Proceedings of the 3rd Joint Conference on Lexical and Computational Semantics (*SEM 2014) Sep 2014
-
Towards Automatic Scoring of Cloze Items by Selecting Low-Ambiguity ContextsIn 3rd workshop on NLP for computer-assisted language learning Sep 2014
2013
-
Recognizing Partial Textual EntailmentIn Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) Sep 2013
-
UKP-BIU: Similarity and Entailment Metrics for Student Response AnalysisIn Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013) Sep 2013
-
SemEval-2013 Task 5: Evaluating Phrasal SemanticsProceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2013) Sep 2013
-
DKPro Similarity: An Open Source Framework for Text SimilarityIn Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (System Demonstrations) Sep 2013
-
Detecting Malapropisms Using Measures of Contextual FitnessSpecial Issue of the TAL Journal on "Managing Noise in the Signal: Error Handling in Natural Language Processing"‚ Sep 2013
-
-
Language Resources and Evaluation Journal - Special Issue on Collaboratively Constructed Language ResourcesIn Proceedings of 9th Conference on Recent Advances in Natural Language Processing (RANLP 2013) Sep 2013
-
Cognate Production using Character-based Machine TranslationIn Proceedings of the 6th International Joint Conference on Natural Language Processing Sep 2013
2012
-
Measuring Contextual Fitness Using Error Contexts Extracted from the Wikipedia Revision HistoryIn Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012) Sep 2012
-
HOO 2012 Shared Task: UKP Lab System DescriptionIn Proceedings of the Seventh Workshop on Innovative Use of NLP for Building Educational Applications at NAACL-HLT Sep 2012
-
-
UKP: Computing Semantic Textual Similarity by Combining Multiple Content Similarity MeasuresIn Proceedings of the 6th International Workshop on Semantic Evaluation, held in conjunction with the 1st Joint Conference on Lexical and Computational Semantics Sep 2012
-
Towards fine-grained readability measures for self-directed language learningIn Proceedings of the SLTC 2012 workshop on NLP for CALL Sep 2012
-
Text Reuse Detection Using a Composition of Text Similarity MeasuresIn Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012) Sep 2012
-
Collective Intelligence and Language Resources: Introduction to the Special Issue on Collaboratively Constructed Language ResourcesIn Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012) Sep 2012
-
Using Distributional Similarity for Lexical Expansion in Knowledge-based Word Sense DisambiguationIn Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012) Sep 2012
2011
-
Wikipedia Revision Toolkit: Efficiently Accessing Wikipedia’s Edit HistoryIn Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. System Demonstrations Sep 2011
-
Wikulu: An Extensible Architecture for Integrating Natural Language Processing Techniques with WikisIn Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. System Demonstrations Sep 2011
-
A Reflective View on Text SimilarityIn Proceedings of the International Conference on Recent Advances in Natural Language Processing Sep 2011
-
First Aid for Information Chaos in Wikis: Collaborative Information Management Enhanced Through Language TechnologyIn Proceedings of the International Conference on Recent Advances in Natural Language Processing Sep 2011
-
Aufbereitung und Strukturierung von Information mittels automatischer SprachverarbeitungIn Proceedings of KnowTech Sep 2011
-
Helping Our Own 2011: UKP Lab System DescriptionIn Proceedings of the Helping Our Own Working Group Session at the 13th European Workshop on Natural Language Generation Sep 2011
2010
-
The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness MeasuresIn Proceedings of the Seventh International Conference on Language Resources and Evaluation Sep 2010
-
2nd Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic ResourcesIn Sep 2010
-
Wisdom of Crowds versus Wisdom of Linguists - Measuring the Semantic Relatedness of WordsJournal of Natural Language Engineering Sep 2010
-
Effektivere Informationssuche im World Wide WebJournal of Natural Language Engineering Sep 2010
2009
-
Proceedings of the Workshop on The People’s Web Meets NLP: Collaboratively Constructed Semantic ResourcesJournal of Natural Language Engineering Sep 2009
-
Semantic relations in a bilingual corpus of different registersIn Deutsche Gesellschaft für Sprachwissenschaft (DGfS) Workshop on Corpus, Colligation, Register Variation Sep 2009
-
An Architecture to Support Intelligent User Interfaces for Wikis by Means of Natural Language ProcessingIn Proceedings of the International Symposium on Wikis and Open Collaboration (WikiSym ’09) Sep 2009
-
Approximate Matching for Evaluating Keyphrase ExtractionIn Proceedings of the 7th International Conference on Recent Advances in Natural Language Processing Sep 2009
-
Study of Semantic Relatedness of Words Using Collaboratively Constructed Semantic ResourcesIn Proceedings of the 7th International Conference on Recent Advances in Natural Language Processing Sep 2009
2008
-
Flexible UIMA Components for Information Retrieval ResearchIn Proceedings of the LREC 2008 Workshop ’Towards Enhanced Interoperability for Large HLT Systems: UIMA for NLP’ Sep 2008
-
Extracting Lexical Semantic Knowledge from Wikipedia and WiktionaryIn Proceedings of the 6th International Conference on Language Resources and Evaluation Sep 2008
-
-
Representational Interoperability of Linguistic and Collaborative Knowledge BasesIn Proceedings of the KONVENS Workshop on Lexical-Semantic and Ontological Resources – Maintenance, Representation, and Standards Sep 2008
-
Graph-Theoretic Analysis of Collaborative Knowledge Bases in Natural Language ProcessingIn Proceedings of the Poster Session of the 7th International Semantic Web Conference Sep 2008
-
Using Wiktionary for Computing Semantic RelatednessIn Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence Sep 2008
-
Using Similarity Measures for Context-Aware User InterfacesIn Proceedings of the 2nd IEEE International Conference on Semantic Computing Sep 2008
2007
-
Analysis of the Wikipedia Category Graph for NLP ApplicationsIn Proceedings of the TextGraphs-2 Workshop (NAACL-HLT 2007) Sep 2007
-
Cross-lingual Distributional Profiles of Concepts for Measuring Semantic DistanceIn Proceedings of EMNLP-CoNLL Sep 2007
-
Darmstadt Knowledge Processing Repository Based on UIMAIn Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology Sep 2007
-
Teaching “Unstructured Information Management: Theory and Applications” to Computational Linguistics StudentsIn Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology Sep 2007
-
Analyzing and Accessing Wikipedia as a Lexical Semantic ResourceIn Proceedings of the First Workshop on Unstructured Information Management Architecture at Biannual Conference of the Society for Computational Linguistics and Language Technology Sep 2007
-
Comparing Wikipedia and German Wordnet by Evaluating Semantic Relatedness on Multiple DatasetsIn Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007) Sep 2007
-
What to be? - Electronic Career Guidance Based on Semantic RelatednessIn Proceedings of ACL Sep 2007
2006
-
Automatically Creating Datasets for Measures of Semantic RelatednessIn Proceedings of the COLING/ACL Workshop on Linguistic Distances Sep 2006