Functionality
From CLAIRlib
Functionality
Native to Clairlib-Core are are tokenization, summarization, LexRank, biased LexRank, document clustering, document indexing, PageRank, biased PageRank, web graph analysis, network generation, power law distribution analysis, network analysis (clustering coefficient, degree distribution plotting, average shortest path, diameter, triangles, shortest path matrices, connected components), cosine similarity, random walks on graphs, statistics (distributions, tests), tf, idf, perceptron learning and classification, phrase- based retrieval and fuzzy-OR queries.
Native to Clairlib-Ext are, additionally, an interface with Weka, a Java-based machine learning toolkit, and latent semantic indexing (LSI).
Functionality imported into Clairlib-Core includes stemming, sentence segmentation, web page download, web crawling, XML parsing, XML tree building, XML writing.
Modules
Browse module documentation: http://clair.si.umich.edu/clair/clairlib/pdoc/
Clairlib includes the following modules:
- Clair::ALE::_SQL
- Clair::ALE::Conn
- Clair::ALE::Default::NormalizeURL
- Clair::ALE::Default::Stemmer
- Clair::ALE::Default::Tokenizer
- Clair::ALE::Extract
- Clair::ALE::Link
- Clair::ALE::NormalizeURL
- Clair::ALE::Search
- Clair::ALE::Stemmer
- Clair::ALE::Tokenizer
- Clair::ALE::URL
- Clair::ALE::Wget
- Clair::ALE::Wget::Hash
- Clair::Algorithm::LSI
- Clair::Bio::Connection
- Clair::Bio::EUtils
- Clair::Bio::EUtils::ESearch
- Clair::Bio::EUtils::ESearchHandler
- Clair::Bio::GeneRIF
- Clair::Centroid
- Clair::CIDR
- Clair::CIDR::Wrapper
- Clair::Classify
- Clair::Cluster
- Clair::Config
- Clair::Corpus
- Clair::Debug
- Clair::Document
- Clair::Extensions
- Clair::Features
- Clair::Gen
- Clair::GenericDoc
- Clair::GenericDoc::html
- Clair::GenericDoc::octet_stream
- Clair::GenericDoc::plain
- Clair::GenericDoc::shakespear
- Clair::GenericDoc::sports
- Clair::GenericDoc::xml
- Clair::GraphWrapper
- Clair::GraphWrapper::Boost
- Clair::IDF
- Clair::Index
- Clair::Index::dirfiles
- Clair::Index::mldbm
- Clair::Info::Query
- Clair::Info::Stats
- Clair::Interface::Weka
- Clair::Learn
- Clair::LinkPolicy::BarabasiAlbert
- Clair::LinkPolicy::ErdosRenyi
- Clair::LinkPolicy::LinkPolicyBase
- Clair::LinkPolicy::MenczerMacro
- Clair::LinkPolicy::MenczerPAMixed
- Clair::LinkPolicy::RadevMicro
- Clair::LinkPolicy::RadevPAMixed
- Clair::LinkPolicy::WattsStrogatz
- Clair::MEAD::DocsentConverter
- Clair::MEAD::Summary
- Clair::MEAD::Wrapper
- Clair::Network
- Clair::Network::Centrality
- Clair::Network::Centrality::Betweenness
- Clair::Network::Centrality::Closeness
- Clair::Network::Centrality::Degree
- Clair::Network::Generator::ErdosRenyi
- Clair::Network::Generator::GeneratorBase
- Clair::Network::Reader
- Clair::Network::Reader::Edgelist
- Clair::Network::Reader::GraphML
- Clair::Network::Reader::Pajek
- Clair::Network::Sample::ForestFire
- Clair::Network::Sample::RandomEdge
- Clair::Network::Sample::RandomNode
- Clair::Network::Sample::SampleBase
- Clair::Network::Writer
- Clair::Network::Writer::Edgelist
- Clair::Network::Writer::GraphML
- Clair::Network::Writer::Pajek
- Clair::NetworkWrapper
- Clair::Nutch::Search
- Clair::Polisci::AU::XMLHandler
- Clair::Polisci::AustralianParser
- Clair::Polisci::Graf
- Clair::Polisci::Record
- Clair::Polisci::Speaker
- Clair::Polisci::US::Connection
- Clair::Polisci::US::XMLHandler
- Clair::RandomDistribution::Gaussian
- Clair::RandomDistribution::LogNormal
- Clair::RandomDistribution::Poisson
- Clair::RandomDistribution::RandomDistributionBase
- Clair::RandomDistribution::RandomDistributionFromWeights
- Clair::RandomDistribution::Zipfian
- Clair::SentenceFeatures
- Clair::SentenceSegmenter::MxTerminator
- Clair::SentenceSegmenter::SentenceSegmenter
- Clair::SentenceSegmenter::Text
- Clair::Statistics::Distributions::DistBase
- Clair::Statistics::Distributions::Geometric
- Clair::Statistics::Distributions::TDist
- Clair::StringManip
- Clair::SyntheticCollection
- Clair::Util
- Clair::Utils::ALE
- Clair::Utils::CorpusDownload
- Clair::Utils::Idf
- Clair::Utils::LinearAlgebra
- Clair::Utils::MxTerminator
- Clair::Utils::Parse
- Clair::Utils::porter
- Clair::Utils::Robot2
- Clair::Utils::Stem
- Clair::Utils::Tf
- Clair::Utils::TFIDFUtils
- Clair::Utils::WebSearch

