Main Page

From Clairlib

Jump to: navigation, search

The Clair library is a suite of open-source Perl modules intended to simplify a number of generic tasks in natural language processing (NLP), information retrieval (IR), and network analysis (NA). Its architecture also allows for external software to be plugged in with very little effort.

  • Code - Clairlib comprises over 100 modules covering functionality for a wide range of tasks
  • Documentation - full API documentation in PDF and HTML format

[edit] Getting Started

  • Installation - instructions for installing Perl and Clairlib
  • Getting Started - how to get up and running, including trying out Clairlib's examples

[edit] Latest Version: Clairlib-Core 1.05 [July 2008]

  • Fixed formatting bugs in CorpusDownload.pm
  • Added get_predecessor_matrix() function in Network.pm
  • Added get_shortest_path() function in Network.pm
  • Added erase_corpus.pl script
  • added erase_isolated_nodes.pl script
  • added --ignore-isolated-nodes in convert_network.pl
  • added several options to print_network_stats.pl: a. --self-loop,
  • completed the descriptions of print_network_stats.pl: added the note of --force into usage.
  • added sentence_to_docs.pl , lines_to_docs.pl under util folder


For full details see the Development page.

[edit] More about Clairlib

  • Contribute - ways to contribute to Clairlib
  • Development - learn about Clairlib development
  • FAQ - answers to frequently asked questions
  • Clairlib-dev - mailing list for discussion among Clairlib developers (and users)
  • People - Clairlib developers and contributors
  • Presentation - an introduction to Clairlib (from October 2006)
  • Projects - ideas for student projects using Clairlib
  • NLP - Wikipedia entry on natural language processing
  • If you publish using clairlib, you should acknowledge its creators. Please use the following bibtex:
 @techreport{Radev&al.07a,
 author =	 "Radev, Dragomir R. and Hodges, Mark and Fader,
                 Anthony and Joseph, Mark and Gerrish, Joshua and
                 Schaller, Mark and dePeri, Jonathan and Gibson,
                 Bryan",
 title =	 "CLAIRLIB Documentation v1.03",
 institution =	 "University of Michigan. Department of Electrical
                 Engineering and Computer Science",
 pdf =
                 "http://tangra.si.umich.edu/~radev/papers/csetr536-07.pdf",
 postscript =
                 "http://tangra.si.umich.edu/~radev/papers/csetr536-07.ps",
 papernum =	 "124",
 year =	 "2007",
 number =	 "CSE-TR-536-07",
 x-category =	 "CLAIR,RADEV,MISC"
Personal tools