TRACER
  • Introduction
  • Manual
    • About the manual
    • Introduction
      • Text Reuse
      • TRACER overview
    • System prerequisites
    • Download and installation
    • Corpus preparation
    • PoS-tagging, lemmatisation & WordNets
    • Configuration
      • Step 1. Preprocessing
      • Step 2. Featuring/Training
      • Step 3. Selection
      • Step 4. Linking
      • Step 5. Scoring
    • Execution of TRACER
    • Postprocessing
    • Results & computed files
    • Keeping a detection logbook
    • Limitations
    • References
  • Beta
    • Visualising results in a tabular format
    • BabelNet API
    • Text preprocessing automation
  • Support
    • Troubleshooting
      • Unable to parse date
      • Unable to access jarfile
      • Empty score and link files
      • Java lang no class definition found
      • Out of memory
      • Out of bounds exception
      • Windows: length of folder name
      • Windows: PowerShell
    • FAQ
    • Publications
    • Resources
    • Command line: MAC vs. Windows
Powered by GitBook
On this page
  1. Beta

Visualising results in a tabular format

PreviousReferencesNextBabelNet API

Last updated 7 years ago

CtrlK

The .score files produced by TRACER can be transformed into a more human-readable format with a script that prints the corresponding text segment next to the reuse ID, as well as the number of overlapping features and the percentage of similarity. In the terminal, navigate to the TRACER main folder and type:

java -cp tracer.jar eu.etrap.tracer.postprocessing.DefaultOutputterMain /.../filename.txt /.../filename.score

Where:

  • /.../filename.txt is the path to the main text file;

  • /.../filename.score is the score file.

The generated file, accessible from /data/corpora/myfolder/, has .expanded as its suffix (i.e., filename.score.expanded) and contains four columns separated by tabs:

RUID 1, RUID 2, FEATUREOVERLAP, SIMTHRESHOLD, TEXT 1, TEXT 2

This .txt output can be duplicated and imported into a .csv file for the gold standard analysis