Changes between Version 1 and Version 2 of Multilingualtaxonomy


Ignore:
Timestamp:
11/25/09 13:10:15 (10 years ago)
Author:
sauermann
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Multilingualtaxonomy

    v1 v2  
    1 '''Multilingual taxonomy (Syria)''' 
    2  
    3  
     1=== Multilingual taxonomy (Syria) === 
    42Syria enquires about the taxonomy and whether this will be available in languages other then English, particularly Italian in their case. 
    53 
    64 
    7 '''RTD Response:''' 
     5=== RTD Response === 
     6A user can write whatever text she or he wants into the taxonomy. They can use terms in any language as long as drupal can store it (drupal can store most languages). The RecommenderComponent, and the TaxonomyLearningComponent do only support english at the moment (we are working on german in winter 2009), so the support for many languages is limited with the automated services. 
     7It requires a technical procedure of analysing a lingual corpus to develop a taxonomy. Currently the project is not aware of such work having taken place in Italian. 
    88 
    9 This is not certain, there are crucial elements to the technical architecture that rely on previous work done within the Semantic Web domain to analyse text. This functionality was primarily developed in English. It requires a technical procedure of analysing a lingual corpus to develop a taxonomy. Currently the project is not aware of such work having taken place in Italian. 
     9 * LeoSauermann says: manual tagging can be done in any language. Automated tagging in any language is a hard problem. It can take up a lot of time (many months, we are depending on open source language analyser modules by OpenNLP here) and money with uncertain outcome. If a serious amount of money (I guesstimate around 10.000-100.000 EUR) is spent on a commercial multilingual tagging software and integrating it, it may work (depends on the licensing scheme of the language software provider, such as Empolis, Texttech, Microsoft FAST, IBM LanguageWare, etc). In the priorities of this research project, from the RTD perspective, having some simple features working may have a higher priority than adding another language. 
    1010 
    11  
    12 '''End Result:''' --- 
     11=== End Result ===