Resources > 
    AnlamVer Dataset
    
    
    
    Word Similarity and Relatedness Dataset for Turkish. 
See the 
 paper "
AnlamVer: Semantic Model Evaluation Dataset for Turkish - Similarity and Relatedness" for details.
    
    
    
    Downloads
    
        Download Final annotated dataset: anlamver-final.cvs 
        
        Download Individual scores of each annotator: anlamver-participants.cvs
        
        
        This dataset is annotated by the open-source software 
WSQuest. 
    
 
    
    Column Names
    
    
    
    
      | Column Abbr. | 
      Column Name | 
      Note | 
    
    
    
    
      | QID | 
      QuestionID | 
       | 
    
    
      | W1 | 
      Word1 | 
       | 
    
    
      | W2 | 
      Word2 | 
       | 
    
    
      | Sim | 
      Similarity | 
      Participants' average | 
    
        
      | Rel | 
      Relatedness | 
      Participants' average | 
    
    
      | S | 
      Similar | 
      Is in (similar) sub-space in Sim-Rel vector space. | 
    
    
      | D | 
      Dissimilar | 
      "" | 
    
        
      | R | 
      Related | 
      "" | 
    
        
      | U | 
      Unrelated | 
      "" | 
    
    
      | SR | 
      SimilarRelated | 
      "" | 
    
    
      | DR | 
      DissimilarRelated | 
      "" | 
    
    
      | SU | 
      SimilarUnrelated | 
      "" | 
    
    
      | DU | 
      DissimilarUnrelated | 
      "" | 
    
    
      | AVG-C | 
      Average concreteness | 
      Individual concreness values from TKN dataset | 
    
    
      | W1F | 
      Word1 frequency | 
      Frequency values based on Boun Corpus | 
    
        
      | W2F | 
      Word2 frequency | 
      Frequency values based on Boun Corpus | 
    
        
      | AnyOOV | 
      Any out-of-vocabulary(OOV) word exists | 
      OOV values are based on BounCorpus | 
    
        
      | Two | 
      Is both words OOV | 
      OOV values are based on BounCorpus | 
    
    
      | EstSyn | 
      EstimatedSynonym | 
      Word-pair estimated as synonyn relation type before the annotation | 
    
        
      | EstAny | 
      EstimatedAntonym | 
      "" | 
    
           
      | EstRHigh | 
      EstimatedHighRelatedness | 
      "" | 
    
        
      | EstRMed | 
      EstimatedMediumRelatedness | 
      "" | 
    
        
      | EstRLow | 
      EstimatedLowRelatedness | 
      "" | 
    
        
      | EstHyp | 
      EstimatedHyponym | 
      "" | 
    
        
      | EstMer | 
      EstimatedMeronym | 
      "" | 
    
        
      | W1-RWG | 
      RareWord(RW) group of word1 | 
      See paper for RW groups. RW groups are assigned by word frequency values. | 
    
        
      | W2-RWG | 
      RareWord(RW) group of word2 | 
      "" | 
    
        
      | RWMin | 
      Minimum group of two words in the word-pair | 
      "" | 
    
        
      | W1-DG | 
      Derivational group of word1 | 
      Value represents how many derivations the word has | 
    
        
      | W2-DG | 
      Derivational group of word2 | 
      -- | 
    
        
      | DGMax | 
      Max of derivational groups | 
      Max(W1-DG,W2-DG) | 
    
          
      | W1-IG | 
      Inflectional group of word1 | 
      Value represents how many inflections the word has | 
    
            
      | W2-IG | 
      Inflectional group of word2 | 
      -- | 
    
          
      | IGMax | 
      Max of inflectional groups | 
      Max(W1-IG,W2-IG) | 
    
    
    
     
    
    
    Cite
    If you use this resource on your research, please cite the following paper: