Operator ToolboxOperator Toolbox

This extension couples some useful additional Operators together.

This extension adds a bunch of new Operators to RapidMiner.They range from utility Operators to improve the flexibility and usability of the process design over additional outlier detection algorithm and additional performance criteria to advanced analysis methods like Local Interpretation or the SMOTE algorithm.

Currently the extension provides the following Operators:

  • Data Access

    • Read Excel Sheet Names (new in 0.9.0)

  • Blending:

    • Group Into Collection (improved in 0.9.0)

    • Merge

    • Collect and Persist

    • Generate Phonetic Encoding

    • Generate Levenshtein Distance

    • Weight of Evidence

    • Extract Statistics

    • Replace Rare Values

    • Smote Upsampling

  • Data Generator:

    • Create ExampleSet (improved in 0.9.0)

    • Generate Univariate Series

  • Outliers:

    • Tukey Test

  • Macros:

    • Extract Last Modifying Operator

    • Set Macros from ExampleSet

  • Parameters:

    • Get Parameters

    • Set Parameters from ExampleSet

  • Models:

    • Dictionary Based Sentiment (improved in 0.9.0)

    • Apply Dictionary Based Sentiment (improved in 0.9.0)

    • Get Decision Tree Path

    • Get Local Interpretation

    • Parametric Probability Estimator

    • Random Forest Encoder (new in 0.9.0)

  • Text Processing:

    • Filter Tokens using ExampleSet

    • Split Document into Collection

    • Stem Tokens using ExampleSet

  • Performance

    • Performance (AUPRC)

Version 0.9.0 (2018-02-02)

  • New Operator Read Excel Sheet Names
  • New Operator Random Forest Encoder
  • Improved Dictionary Based Sentiment
    • It can now handle negation words by using a 'negation' dictionary
  • Improved Group Into Collection
    • Additional option to sort the Collection
  • Improved Create ExampleSet
    • Additional option to trim attribute names
    • Added Meta Data handling

Version 0.8.0 (2017-12-22)

  • New Operator Replace Rare Values
  • New Operator Merge
  • Improved Operator Parameteric Probability Estimator
    • Automatic capabilities check
    • Added calculation of pValues for each new Attribute. pValues are provided as AttributeWeights by a new weigths output port
    • Added possibility to apply a threshold on this pValues, so that only matching Attributes are created
  • Enhancements: Improved documentation and tutorial processes, Create ExampleSet uses now nominal as value type for text-input data

Version 0.7.0 (2017-11-28)

  • New Operator Parameteric Probability Estimator
  • Enhancement: Apply Dictionary Based Sentiment: Added number of positive, negative and the total number of tokens
  • Bugfix: Group Into Collection: Fixed a bug with weird entries for special Attributes.

Version 0.6.1 (2017-10-27)

  • New Operator Smote Upsampling

  • New Operator Generate Univariate Series

    • Combine functionality of Generate Date Series (now deprecated) with generating a numerical series

  • Bugfix: Group Into Collection: Special roles are now correctly set

  • Enhancement: Generate ExampleSet: option to parse all data as NOMINAL

Version 0.5.0 (2017-08-21)

  • New Operator Extract Statistics

  • Work on Get Local Interpretation:

    • New inner performance port, if the performance of the inner model is calculated and connected to this port it is added to the corresponding Example

    • Bugfix that the Weight Threshold was not used.

Version 0.4.1 (2017-07-27)

  • Stem Tokens Using ExampleSet

  • Weight of Evidence

  • Split Document into Collection

  • Dictionary Based Sentiment

  • Apply Dictionary Based Sentiment

  • Performance (AUPRC)

  • Additional Changes:

    • improved documentation of Tukey Test

    • Bugfix Create ExampleSet and Get Local Interpretation

    • Additional OutputPort for Get Local Interpretation

Version 0.3.0 (2017-05-18)

  • Create ExampleSet

  • Set Parameters from ExampleSet

  • Set Macros from ExampleSet

  • Get Local Interpretation

  • Collect and Persist

Version 0.2.0 (2017-03-23)

  • Get Parameters

  • Get Decision Tree Path

  • Generate Date Series

Version 0.1.0 (2017-02-16)

  • Group Into Collection

  • Tukey Test

  • Generate Levenshtein Distance

  • Generate Phonetic Encoding

  • Get Last Modifying Operator

Product Details

Version 0.9.0
File size 4.6 MB
Downloads 12317 (58 Today)12317 downloads
Vendor RapidMiner Labs
Category Operators
Released 2/2/18
Last Update 2/2/18 1:18 PM
License AGPL
Product web site
Rating 0.0 stars(0)