Functionality

From Clairlib
Jump to: navigation, search

Functionality

Native to Clairlib-Core are are tokenization, summarization, LexRank, biased LexRank, document clustering, document indexing, PageRank, biased PageRank, web graph analysis, network generation, power law distribution analysis, network analysis (clustering coefficient, degree distribution plotting, average shortest path, diameter, triangles, shortest path matrices, connected components), cosine similarity, random walks on graphs, statistics (distributions, tests), tf, idf, perceptron learning and classification, phrase- based retrieval and fuzzy-OR queries.

Native to Clairlib-Ext are, additionally, an interface with Weka, a Java-based machine learning toolkit, and latent semantic indexing (LSI).

Functionality imported into Clairlib-Core includes stemming, sentence segmentation, web page download, web crawling, XML parsing, XML tree building, XML writing.

Modules

Browse module documentation: http://clairlib.org/pdoc/

Clairlib includes the following modules:

  • Clair::ALE::_SQL
  • Clair::ALE::Conn
  • Clair::ALE::Default::NormalizeURL
  • Clair::ALE::Default::Stemmer
  • Clair::ALE::Default::Tokenizer
  • Clair::ALE::Extract
  • Clair::ALE::Link
  • Clair::ALE::NormalizeURL
  • Clair::ALE::Search
  • Clair::ALE::Stemmer
  • Clair::ALE::Tokenizer
  • Clair::ALE::URL
  • Clair::ALE::Wget
  • Clair::ALE::Wget::Hash
  • Clair::Algorithm::LSI
  • Clair::Bio::Connection
  • Clair::Bio::EUtils
  • Clair::Bio::EUtils::ESearch
  • Clair::Bio::EUtils::ESearchHandler
  • Clair::Bio::GeneRIF
  • Clair::Centroid
  • Clair::CIDR
  • Clair::CIDR::Wrapper
  • Clair::Classify
  • Clair::Cluster
  • Clair::Config
  • Clair::Corpus
  • Clair::Debug
  • Clair::Document
  • Clair::Extensions
  • Clair::Features
  • Clair::Gen
  • Clair::GenericDoc
  • Clair::GenericDoc::html
  • Clair::GenericDoc::octet_stream
  • Clair::GenericDoc::plain
  • Clair::GenericDoc::shakespear
  • Clair::GenericDoc::sports
  • Clair::GenericDoc::xml
  • Clair::GraphWrapper
  • Clair::GraphWrapper::Boost
  • Clair::IDF
  • Clair::Index
  • Clair::Index::dirfiles
  • Clair::Index::mldbm
  • Clair::Info::Query
  • Clair::Info::Stats
  • Clair::Interface::Weka
  • Clair::Learn
  • Clair::LinkPolicy::BarabasiAlbert
  • Clair::LinkPolicy::ErdosRenyi
  • Clair::LinkPolicy::LinkPolicyBase
  • Clair::LinkPolicy::MenczerMacro
  • Clair::LinkPolicy::MenczerPAMixed
  • Clair::LinkPolicy::RadevMicro
  • Clair::LinkPolicy::RadevPAMixed
  • Clair::LinkPolicy::WattsStrogatz
  • Clair::MEAD::DocsentConverter
  • Clair::MEAD::Summary
  • Clair::MEAD::Wrapper
  • Clair::Network
  • Clair::Network::Centrality
  • Clair::Network::Centrality::Betweenness
  • Clair::Network::Centrality::Closeness
  • Clair::Network::Centrality::Degree
  • Clair::Network::Generator::ErdosRenyi
  • Clair::Network::Generator::GeneratorBase
  • Clair::Network::Reader
  • Clair::Network::Reader::Edgelist
  • Clair::Network::Reader::GraphML
  • Clair::Network::Reader::Pajek
  • Clair::Network::Sample::ForestFire
  • Clair::Network::Sample::RandomEdge
  • Clair::Network::Sample::RandomNode
  • Clair::Network::Sample::SampleBase
  • Clair::Network::Writer
  • Clair::Network::Writer::Edgelist
  • Clair::Network::Writer::GraphML
  • Clair::Network::Writer::Pajek
  • Clair::NetworkWrapper
  • Clair::Nutch::Search
  • Clair::Polisci::AU::XMLHandler
  • Clair::Polisci::AustralianParser
  • Clair::Polisci::Graf
  • Clair::Polisci::Record
  • Clair::Polisci::Speaker
  • Clair::Polisci::US::Connection
  • Clair::Polisci::US::XMLHandler
  • Clair::RandomDistribution::Gaussian
  • Clair::RandomDistribution::LogNormal
  • Clair::RandomDistribution::Poisson
  • Clair::RandomDistribution::RandomDistributionBase
  • Clair::RandomDistribution::RandomDistributionFromWeights
  • Clair::RandomDistribution::Zipfian
  • Clair::SentenceFeatures
  • Clair::SentenceSegmenter::MxTerminator
  • Clair::SentenceSegmenter::SentenceSegmenter
  • Clair::SentenceSegmenter::Text
  • Clair::Statistics::Distributions::DistBase
  • Clair::Statistics::Distributions::Geometric
  • Clair::Statistics::Distributions::TDist
  • Clair::StringManip
  • Clair::SyntheticCollection
  • Clair::Util
  • Clair::Utils::ALE
  • Clair::Utils::CorpusDownload
  • Clair::Utils::Idf
  • Clair::Utils::LinearAlgebra
  • Clair::Utils::MxTerminator
  • Clair::Utils::Parse
  • Clair::Utils::porter
  • Clair::Utils::Robot2
  • Clair::Utils::Stem
  • Clair::Utils::Tf
  • Clair::Utils::TFIDFUtils
  • Clair::Utils::WebSearch
Personal tools
Namespaces

Variants
Actions
Main Menu
Documentation
Clairlib Lab
Community
Development
Toolbox