TEXAS 2014 - International Workshop on Taxonomy Extraction with Applications in Semantics
Topics/Call fo Papers
Taxonomies form the backbone of knowledge-based systems by organizing knowledge in a machine interpretable manner and facilitating information integration. Hierarchical structures provide valuable input in knowledge-intensive applications such as question answering and textual entailment and are useful tools for browsing and navigation of document collections, especially when applied for exploration and discovery.
Although some taxonomies are readily available as part of language and web resources such as WordNet and Wikipedia, not all domains are covered and existing taxonomies are often too small to fully describe a domain. Automatic taxonomy extraction methods have been developed in recent years to address this problem, but issues remain in evaluation, comparison and application of extracted taxonomies [1, 2, 3, 4, 5, 6]. Depending on the application, multiple perspectives can be equally valid both in the selection of concepts and in the extraction of relations between them. This makes the resulting taxonomies difficult to compare, as they are based on different requirements. For instance, WordNet is a lexical semantic resource that is used mainly for tracking hyponymic substitution (e.g. ‘table’ can be replaced by ‘furniture’) with the main requirement of broad lexical coverage. On the other hand, subject hierarchies, such as the ACM Subject Classification, are used mainly for document collection browsing (e.g. fine-grained topic distinction such as ‘information retrieval’ vs. ‘information extraction’) with the main requirements of comprehensibility and coherence.
The TEXAS workshop aims at addressing these issues by providing a venue for presenting and discussing approaches that evaluate taxonomy extraction [7], and its subtasks (term/concept extraction, term/concept relation discovery, taxonomy construction and cleaning) in the context of semantic applications such as: entity search, entity disambiguation and linking, information integration and summarization, knowledge acquisition, knowledge sharing, inference in NLP tasks (question answering, textual entailment), etc. In this way, progress towards automatically constructed hierarchies can be measured relative to other tasks and real-world applications.
[1] Kozareva, Zornitsa, and Eduard Hovy. "A semi-supervised method to learn and construct taxonomies using the web." Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2010.
[2] Navigli, Roberto, Paola Velardi, and Stefano Faralli. "A graph-based algorithm for inducing lexical taxonomies from scratch." Proceedings of the Twenty-Second international joint conference on Artificial Intelligence- Volume Three. AAAI Press, 2011.
[3] Medelyan, Olena, et al. "Constructing a Focused Taxonomy from a Document Collection." The Semantic Web: Semantics and Big Data. Springer Berlin Heidelberg, 2013. 367-381.
[4] Stoica, Emilia, Marti A. Hearst, and Megan Richardson. "Automating Creation of Hierarchical Faceted Metadata Structures." HLT-NAACL. 2007. [5] Wang, Wei, Payam M. Barnaghi, and Andrzej Bargiela. "Probabilistic topic models for learning terminological ontologies." Knowledge and Data Engineering, IEEE Transactions on 22.7 (2010): 1028-1040.
[6] Paola Velardi, Stefano Faralli, Roberto Navigli: OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction. Computational Linguistics 39(3): 665-707 (2013)
[7] Elias Zavitsanos, Georgios Paliouras, George A. Vouros: Gold Standard Evaluation of Ontology Learning Methods through Ontology Transformation and Alignment. IEEE Trans. Knowl. Data Eng. 23(11): 1635-1648 (2011)
Topics
Expected research topics of relevance to the workshop:
application-based evaluation of taxonomies in question answering, document browsing,document clustering, expert finding or other applications;
using automatically constructed taxonomies for searching, browsing and organizing information
constructing taxonomies for/from social media
probabilistic models for topic hierarchies (hierarchical topic modeling)
constructing taxonomies using hierarchical clustering
using distributional models for taxonomy construction
acquisition and modeling of categorical structure and modeling human category acquisition
constructing topic categorization systems and subject hierarchies
constructing hierarchical faceted metadata structures
methods for transforming semi-structured knowledge resources into taxonomies
merging and aligning existing resources for taxonomy construction
comparing, aligning and evaluating existing hierarchical structures
domain glossary acquisition and extracting taxonomies from definitions
constructing application/domain specific taxonomies from existing resources (lexical resources,Linked Open Data, Wikipedia category structure, semantic networks)
using different hierarchical structures (e.g., tree, DAG) and relation types (e.g., hyponymy, meronymy) for taxonomy construction
attaching Named Entities to hierarchical structures and using Named Entities to drive taxonomy construction by extensional analysis
multilinguality and taxonomies: constructing and using multilingual taxonomies
Although some taxonomies are readily available as part of language and web resources such as WordNet and Wikipedia, not all domains are covered and existing taxonomies are often too small to fully describe a domain. Automatic taxonomy extraction methods have been developed in recent years to address this problem, but issues remain in evaluation, comparison and application of extracted taxonomies [1, 2, 3, 4, 5, 6]. Depending on the application, multiple perspectives can be equally valid both in the selection of concepts and in the extraction of relations between them. This makes the resulting taxonomies difficult to compare, as they are based on different requirements. For instance, WordNet is a lexical semantic resource that is used mainly for tracking hyponymic substitution (e.g. ‘table’ can be replaced by ‘furniture’) with the main requirement of broad lexical coverage. On the other hand, subject hierarchies, such as the ACM Subject Classification, are used mainly for document collection browsing (e.g. fine-grained topic distinction such as ‘information retrieval’ vs. ‘information extraction’) with the main requirements of comprehensibility and coherence.
The TEXAS workshop aims at addressing these issues by providing a venue for presenting and discussing approaches that evaluate taxonomy extraction [7], and its subtasks (term/concept extraction, term/concept relation discovery, taxonomy construction and cleaning) in the context of semantic applications such as: entity search, entity disambiguation and linking, information integration and summarization, knowledge acquisition, knowledge sharing, inference in NLP tasks (question answering, textual entailment), etc. In this way, progress towards automatically constructed hierarchies can be measured relative to other tasks and real-world applications.
[1] Kozareva, Zornitsa, and Eduard Hovy. "A semi-supervised method to learn and construct taxonomies using the web." Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2010.
[2] Navigli, Roberto, Paola Velardi, and Stefano Faralli. "A graph-based algorithm for inducing lexical taxonomies from scratch." Proceedings of the Twenty-Second international joint conference on Artificial Intelligence- Volume Three. AAAI Press, 2011.
[3] Medelyan, Olena, et al. "Constructing a Focused Taxonomy from a Document Collection." The Semantic Web: Semantics and Big Data. Springer Berlin Heidelberg, 2013. 367-381.
[4] Stoica, Emilia, Marti A. Hearst, and Megan Richardson. "Automating Creation of Hierarchical Faceted Metadata Structures." HLT-NAACL. 2007. [5] Wang, Wei, Payam M. Barnaghi, and Andrzej Bargiela. "Probabilistic topic models for learning terminological ontologies." Knowledge and Data Engineering, IEEE Transactions on 22.7 (2010): 1028-1040.
[6] Paola Velardi, Stefano Faralli, Roberto Navigli: OntoLearn Reloaded: A Graph-Based Algorithm for Taxonomy Induction. Computational Linguistics 39(3): 665-707 (2013)
[7] Elias Zavitsanos, Georgios Paliouras, George A. Vouros: Gold Standard Evaluation of Ontology Learning Methods through Ontology Transformation and Alignment. IEEE Trans. Knowl. Data Eng. 23(11): 1635-1648 (2011)
Topics
Expected research topics of relevance to the workshop:
application-based evaluation of taxonomies in question answering, document browsing,document clustering, expert finding or other applications;
using automatically constructed taxonomies for searching, browsing and organizing information
constructing taxonomies for/from social media
probabilistic models for topic hierarchies (hierarchical topic modeling)
constructing taxonomies using hierarchical clustering
using distributional models for taxonomy construction
acquisition and modeling of categorical structure and modeling human category acquisition
constructing topic categorization systems and subject hierarchies
constructing hierarchical faceted metadata structures
methods for transforming semi-structured knowledge resources into taxonomies
merging and aligning existing resources for taxonomy construction
comparing, aligning and evaluating existing hierarchical structures
domain glossary acquisition and extracting taxonomies from definitions
constructing application/domain specific taxonomies from existing resources (lexical resources,Linked Open Data, Wikipedia category structure, semantic networks)
using different hierarchical structures (e.g., tree, DAG) and relation types (e.g., hyponymy, meronymy) for taxonomy construction
attaching Named Entities to hierarchical structures and using Named Entities to drive taxonomy construction by extensional analysis
multilinguality and taxonomies: constructing and using multilingual taxonomies
Other CFPs
- First Workshop on Computational Approaches to Code Switching
- Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation
- International Workshop on Language Technology for Closely Related Languages and Language Variants
- Workshop on Modeling Large Scale Social Interaction in Massively Open Online Courses
- The 34th Annual Meeting of the Israel Orthopaedic Association (IOA) & The 2nd Conference of the Mediterranean Trauma Society
Last modified: 2014-05-01 22:17:48