Palladian

 

Palladian is a Java-based toolkit which provides functionality to perform typical Internet Information Retrieval tasks. It provides a collection of algorithms for text processing focused on classification, extraction of various types of information, and retrieval. The aim of Palladian is to reuse algorithms that are freely available and build upon them to drive research by providing unified interfaces. This way, new algorithms can be quickly compared to the state-of-the-art allowing other users to create more advanced programs in the future.
 
The growing collection of Palladian KNIME nodes provide the possibility to use Palladian’s capabilities directly within KNIME, to complement and extend existing workflows, or to allow for quick prototyping without having to write any code. The current version features the following nodes:
 
  • TextClassifierLearner and TextClassifierPredictor
  • PalladianNer
  • HttpRetriever
  • NekoHtmlParser
  • WebSearcher
  • RankingServices
  • FeedDiscovery
  • FeedParser
  • ContentExtractor
  • DateExtractor
  • RMSECalculator
  • APCalculator
  • RatioCalculator
  • DatasetReader
 
Installation instructions for the nodes can be found here: http://tech.knime.org/community-installation
More information about the Palladian toolkit is available here: http://palladian.ws/
If you have any questions, comments, or problems, we are happy to hear from you: mail@palladian.ws