Link Search Menu Expand Document

Setup

Table of contents

  1. Preliminary steps
  2. TGrep2 and TDTlite setup
  3. Checklist

These setup instructions are specific to Stanford affiliates. You will access both TGrep2 and TDTlite via AFS. For information on how to set up the TGrep2/TDT pipeline at a different institution, email Judith Degen at jdegen@stanford.edu. (Documentation forthcoming.)

Preliminary steps

TGrep2 and TDTlite require perl and Python.

To use TGrep2, you’ll need to log onto a server with an AFS mount. Cardinal works as of April 2023.

Before you can access the tools, you must ask the corpus TA to add your SUNet ID to the corpora-general group. Go here to see who is the current corpus TA. Once you have been added to the group, following the remaining instructions.

TGrep2 and TDTlite setup

Log onto a server. In your terminal, type:

ssh SUNET@cardinal.stanford.edu

Enter your password at the prompt and complete 2-factor authentication. You will be transported to your home directory.

Add the following to your bash profile:

export PATH=$PATH:~/.gem/bin:/afs/ir/data/linguistic-data/bin/linux_2_4:/afs/ir/data/linguistic-data/TDTlite:~/bin

export TGREP2ABLE=/afs/ir/data/linguistic-data/Treebank/tgrep2able/

export TDTlite=/afs/ir/data/linguistic-data/TDTlite/

export TGREP2_CORPUS=$TGREP2ABLE/swbd.t2c.gz

export TDT_DATABASES=/afs/ir/data/linguistic-data/TDTlite/databases/

If you’re not sure how to access your bash profile:

  1. At the prompt, type:

vi .bash_profile

  1. To get into insert mode, type the letter i. Copy and paste the above content into the very start of the file. Make sure to press Enter after you have pasted to make sure there is a line separating the pasted content from any following content.

  2. Save and exit as follows: Press the Esc key. Enter a colon, then the letters wq and press Enter.

Source the bash profile:

source .bash_profile

Now you should be able to access TGrep2 and the TDT. Typing tgrep2 at the prompt should print usage information for TGrep2. Typing run -h at the prompt should print usage information for TDTlite. You’re all set! Check out the tutorials for how to get started.

Checklist

Here is a recap of all the things that should be set up on the server.

  • Your SUNet has been added to corpora-general by the corpus TA.
  • You have set the relevant environment variables in your bash profile.