Changelog for KNIME Analytics Platform 3.3

Changes between v3.3.1 and v3.3.2

Release date: April 07, 2017

Enhancement

  • AP-6983: REST nodes should gracefully handle illegal JSON with BOM header
  • AP-6959: Excel Reader: New option "Skip hidden columns"
  • AP-6947: Upgrade Marvin Extension to 3.1.0.v0506
  • AP-6862: Fix error message for File Upload quickform when using AP view
  • AP-6861: File Upload node timeout node description improvement
  • AP-6809: Text Mining: export tokenizer in Textprocessing plugin
  • AP-6752: Add system property to prevent reading external entities by XML Reader
  • AP-6651: (API) Add NodeID#isRoot() method
  • AP-6214: Database writer with new option to fail on error
  • AP-7015: Create File Name node should push flowvariable during configure
  • BD-340: (Big Data Extensions): Detect Simba-based Hive JDBC driver provided by Hortonworks
  • BD-277: (Big Data Extensions): Add option for Kerberos debugging in Kerberos preferences
  • BD-251: (Big Data Extensions): Automatic user impersonation for Hive and Impala connections with Simba-based JDBC drivers on KNIME Server

Bug Fixes

  • AP-7084: XML Writer node doesn't work on server any more
  • AP-7083: JSON Writer node doesn't work on server any more
  • AP-7079: List Files doesn't work on server any more
  • AP-7078: Image Column Writer doesn't work on server any more
  • AP-7018: Line Reader with invalid default values
  • AP-6954: Excel Reader (XLS) will not read sheets if there are empty cells with comments
  • AP-6915: JavaSnippet: Missing support for PNG to String conversion
  • AP-6865: DeepLearning4J: Image Input broken for deprecated DeepLearning4J Learner Node
  • AP-6864: New Excel Writer fails on option AutoSize Column
  • AP-6851: Java Snippet: runtime classloader too permissive -- possibly loading classes from different bundle
  • AP-7188: Excel Reader sometimes doesn't work properly when run on server
  • AP-7176: Wrong replace column selected when opening Rule Engine node first time after loading workflow
  • AP-7105: Text Mining: the bounds of the min max values of the Frequency Filter node are too limiting
  • AP-7101: Text Mining: Tagger nodes fail when input does not contain document column
  • AP-7081: Smiles Reader doesn't work on server
  • AP-7080: Molfile Reader doesn't work on server
  • AP-7058: Workflow coach doesn't work behind proxies
  • AP-7046: JavaScript Parallel Coordinates does not fade lines when subscribing to selection events
  • AP-7041: Text Mining: Markup Filter needs tokenizer selection
  • AP-7040: Column Filter (PMML) produces NullPointerException
  • AP-7035: Excel Reader stuck parsing for certain (sparse) files
  • AP-7033: Text Mining: Default normalization of Spanish tokenizer needs to be switched off
  • AP-7014: Create file Name node cannot overwrite an existing variable
  • AP-7012: Java Snippet adds entry to import section if collection of collections column is selected
  • AP-7010: Data first loaded with SandboxedNodeCreator is inaccessible afterwards
  • AP-6994: Enrichment Plotter: The discovery rates and enrichment factors are all 0 if the smallest fraction contains less then one row
  • AP-6956: Database preview panel does not set NodeContext
  • AP-6937: Workflow Coach: No recommendations shown when empty workflow is opened
  • AP-6936: Node description of column filter quickform wrongly claims the output table is empty (no rows)
  • AP-6897: Impossible to pick new file in Amazon S3 File Picker if a file was already selected
  • AP-6882: Text Mining: Number Filter node not catching all numbers (e.g. decimal numbers with thousand separators)
  • AP-6875: Excel Writer / Linux: Opening file after writing in native editor may cause KNIME to crash
  • AP-6871: Text Mining: Strings to Document fails on empty table
  • AP-6870: (New) Excel Reader node has confusing row IDs
  • AP-6868: JavaScript Conditional BoxPlot fails if not enough values present
  • AP-6852: Difficult to select an outport with some connector geometries
  • AP-6822: Closing KNIME while starting up (reading node repository) may cause KNIME to hang
  • AP-6812: JavaScript Decision Tree View: Abbreviate long strings
  • AP-6760: JavaScript Decision Tree View: Fix various sizing problems
  • AP-6732: Problems with optional parameters of Openbabel node
  • BD-366: (Big Data Extensions): Connection to Spark Jobserver does not work behind proxy
  • BD-345: (Big Data Extensions): Hive/Impala loader and Spark reader/writer nodes do not store flow variable button state
  • BD-344: (Big Data Extensions):     
    Memory leak in Spark driver caused by Hive/Database to Spark and Spark to Hive/Database nodes
  • BD-331: (Big Data Extensions): Collaborative filtering does not use lambda parameter
  • BD-309: (Big Data Extensions): Second output of Spark Partitioning node lacks rows when input data contains duplicate rows
  • BD-300: (Big Data Extensions): Fix display and logging of Spark errors
  • BD-297: (Big Data Extensions): Spark jobs don't delete temporarily uploaded files on Spark Jobserver
  • PE-41: Call Local Workflow node doesn't work on headless servers
  • PE-40: Call Local Workflow node doesn't work on server any more
  • PE-37: Templates based on Wrapped Metanode must not include 'dialog configuration'

Changes between v3.3.0 and v3.3.1

Release date: December 21, 2016

Enhancement

  • AP-6794: Add Aggregation methods for SQL Server
  • AP-6820: Timeout of the FileUpload node should be configurable

Bug Fixes

  • AP-6819: Java Snippet is not configurable if configured input column is missing
  • AP-6805: JavaSnippet: List of deprecated DateAndTimeCell not convertible
  • AP-6770: Win32 installer broken - tries to access 64bit registry entries
  • AP-6763: File Reader Configuration dialog with unusable layout on some Windows systems
  • AP-6762: XLS reports cannot be generated any more
  • AP-6815: Java Snippet node throws FileNotFound exception when running in test flow with open dialog enabled
  • AP-6810: Normalize tags for German and Spanish NER models
  • AP-6799: JavaScript Decision Tree View: Fix Layout Issue
  • AP-6793: JavaScript Parallel Coordinates view does not open on Windows
  • AP-6791: Saved StringsToDocuments node cannot be loaded
  • AP-6761: KNIME SDK: Runtime instance possibly not starting unless "knime.enable.fastload" is set
  • AP-3317: Java Snippet: better "BinaryObject" support via byte[]

Changes between v3.2 and v3.3

(see highlight summary)

Release date: December 6, 2016

New nodes

  • AP-4783: Math Formula (Variable)
  • AP-3654: String Manipulation (Variable)
  • AP-5566: Database Table Creator
  • AP-6265: Azure Blob Storage Connector
  • AP-6047: Amazon S3 Connector
  • AP-6056: MSSQL Connector
  • AP-5990: Create Filename
  • AP-6120: Text Mining: Tika Language Detection
  • AP-6080: Text Mining: Tika Parser URL Input (integration with Apache TIKA)
  • AP-5910: Text Mining: Tika Parser (integration with Apache TIKA)
  • AP-6510: Text Mining: Lemmatizer
  • AP-6450: Text Mining: Diacritis Remover node
  • AP-6435: Text Mining: RSS Feed Reader
  • AP-6427: Text Mining: Document Vector Adapter
  • AP-6422: Text Mining: Stanford NE Model Scorer
  • AP-6421: Text Mining: Stanford Tagger
  • AP-6046: Text Mining: Document Vector Hashing
  • AP-6042: Text Mining: Entropy Node
  • AP-6117: Text Mining: NER Learner node (Standford NLP)
  • AP-6532: Quickform Slider Input
  • AP-6298: Range Slider Filter Definition
  • AP-6734: Filter Apply Row Splitter node
  • AP-6624: Filter definition apply node
  • AP-6623: Filter definition merger node
  • AP-6151: JavaScript Parallel Coordinates View
  • AP-6090: JavaScript Decision Tree View
  • AP-6215: DeepLearning4J: DeepLearning4J Feedforward Learner (split up of previous all-in-one learner)
  • AP-6091: DeepLearning4J: Regression Learner
  • AP-6295: Extract Missing Value Cause (node that extracts the error message from a missing cell, if present)
  • BD-231: Big Data Extensions: Spark AVRO file reader/writer nodes
  • BD-230: Big Data Extensions: Spark JSON file reader/writer nodes
  • BD-171: Big Data Extensions: Spark Text file reader/writer nodes
  • BD-167: Big Data Extensions: Spark Parquet file reader/writer nodes
  • BD-166: Big Data Extensions: Spark ORC file reader/writer nodes
  • BD-165: Big Data Extensions: Spark CSV file reader/writer nodes
  • BD-136: Big Data Extensions: Spark to Database
  • BD-135: Big Data Extensions: Database to Spark node
  • BD-74: Big Data Extensions: Impala to Spark and Spark to Impala node
  • BD-245: Big Data Extensions: Spark SQL Query

Enhancement

  • AP-5984: Upgrade to Eclipse 4.6
  • AP-6031: Curved connections between nodes in workflow editor
  • AP-6019: Searching for a column name in the Table View by substring
  • AP-1198: XLS Reader/Writer nodes to use different data model (scales significantly better for large tables/xls files)
  • AP-5415: XLS Writer to warn if user writes old 'xls' format (suggests 'xlsx' instead)
  • AP-4901: XLS Appender should give a warning during configure if the selected file does not exist
  • AP-4612: XLS Reader Should use strings not integers for column range specification via flow variables.
  • AP-6169: XLS Reader option to read cached values instead of evaluating formulas
  • AP-6606: Update Marvin Extension (kindly contributed by ChemAxon/Infocom) to ver3.1.0.v0505
  • AP-6243: Add support for a HELM cell type
  • AP-6247: PMML Model Appender: support for regression tree models
  • AP-6260: Feature Selection loop with variables containing current iteration and column name
  • AP-5900: SAS Reader: Update parso library to 2.0.2
  • AP-6736: (API) #configure method in JfcGenericBaseNodeModel to be no longer final -- requested by community
  • AP-6730: Buttons for different mouse modes on JavaScript Scatter Plot
  • AP-6713: Rserve receiving buffer size limit should accept 0 as input
  • AP-6711: Python nodes: "Execute Selection" action to run all selected lines (as opposed to selected text only)
  • AP-6704: Adjust tokenizer description appearence in Preference Page
  • AP-6703: Add tokenizer selection in Node Dialog for Strings To Document Node
  • AP-6701: Add tokenizer selection in Node Dialog for RSS Feed Reader
  • AP-6664: JSON Path node: allow changing the order of output columns
  • AP-6642: Default memory assignment to be 2GB for 64bit archs (-Xmx2G)
  • AP-6622: New port type for filter definitions
  • AP-6621: Use location flow variables values in reader dialogs
  • AP-6617: Static textprocessing files should be extracted into separate bundle
  • AP-6607: Generate useful dates in test data generator
  • AP-6582: Database Query node is configurable even if input spec is missing
  • AP-6579: Define flow variables with type and default in Generic JavaScript View dialog
  • AP-6542: Upgrade Generic JavaScript jQuery version to 3.x
  • AP-6531: Prepare REST extension for NTLM authentication
  • AP-6521: Text Mining: Nodes with tokenizer settings in dialog need to validate selected tokenizer
  • AP-6507: Add option to JS table view to format date/time strings in utc format
  • AP-6477: Add timeout option to Excel/XLS Reader
  • AP-6473: Update Selenium to 3.0
  • AP-6472: Update all PhantomJS binaries to 2.1
  • AP-6451: (API) New 'FilterHandler' handler on data columns -- needed to enable interactive filtering in JS views/webportal/guided analytics
  • AP-6446: Add possibility to output flow variable to Generic JavaScript view
  • AP-6408: XLS Writer/Appender node to automatically determine file ending (less picky when the user is lazy and omits '.xlsx')
  • AP-6403: Views on Wrapped Metanodes (delegated from contained JS view nodes)
  • AP-6393: Convert settings from JavaScript Line Plot to service menu
  • AP-6392: Convert settings from JavaScript Pie Chart to service menu
  • AP-6391: Convert settings from JavaScript Bar Chart to service menu
  • AP-6390: Convert settings from JavaScript Parallel Coordinates Plot to service menu
  • AP-6388: Convert settings from JavaScript Lift Chart to service menu
  • AP-6387: Convert settings from JavaScript ROC Curve to service menu
  • AP-6386: Convert settings from JavaScript Conditional Box Plot to service menu
  • AP-6385: Convert settings from JavaScript Box Plot to service menu
  • AP-6369: Add fullscreen support to JavaScript scatter plot and paged table
  • AP-6367: Create option to toggle JavaScript views in fullscreen mode in KNIME service
  • AP-6366: Convert settings from JavaScript scatter plot to service menu
  • AP-6365: Create menu in KNIME service JavaScript framework
  • AP-6360: List Box Quickform needs size option
  • AP-6355: (API): New dialog component: DialogComponentDuration
  • AP-6332: File location input fields in dialogs should resize with dialog
  • AP-6325: Subscribe to selection events in JavaScript scatter plot
  • AP-6324: Publish selection events for JavaScript scatter plot
  • AP-6323: Subscribe to filter events in JavaScript scatter plot
  • AP-6303: Subscribe to filter events in JavaScript paged table
  • AP-6302: Publish selection events for JavaScript paged table view
  • AP-6301: Subscribe to selection events in JavaScript paged table
  • AP-6299: Option for JavaScript paged table to switch visibility by selected rows
  • AP-6297: Write framework component to be included in each iframe (pub/sub)
  • AP-6246: JSON Patch node should keep input unchanged if "test" operation fails
  • AP-6245: Table to JSON node should have option to ignore missing cells
  • AP-6228: Split KNIME GUI-preference page
  • AP-6226: Add node connection properties to workflow editor settings dialog (and replace the grid setting dialog by it)
  • AP-6216: Kaplan-Meier Estimator should generate a view with the Kaplan-Meier Estimator Plot
  • AP-6439: DeepLearning4J: Upgrade to version 0.6.0
  • AP-6088: DeepLearning4J: Beautify Learner Node Dialog
  • AP-6376: DeepLearning4J: Remove massive logging
  • AP-6375: DeepLearning4J: Split Predictor Nodes
  • AP-6250: DeepLearning4J: Enhance performance of RowConverters
  • AP-6337: DeepLearning4J: New metanode for Deep Belief networks
  • AP-6189: DeepLearning4J: Potential KNIME crash on Windows caused by DLL conflicts
  • AP-6187: DeepLearning4J Learner: NPE when no layers are added to the network configuration
  • AP-6186: DeepLearning4J Learner: spec does not contain output layer in unsupervised learning
  • AP-6197: Wilcoxon Signed-Rank node: Additional statistics
  • AP-6185: Ability to ensure test workflow version for backwards compatibility testing
  • AP-6176: Upgrade Generic JavaScript D3 version to 4.x
  • AP-6134: Missing Values to be more prominently highlighted in table view (red) and should show error cause, if available
  • AP-6133: Expand Vector Node: use operator internals in streaming implementation
  • AP-6129: New workspaces with more comprehensive set of example workflows
  • AP-6119: Text Mining: Document Viewer - switch html interpretation on/off
  • AP-6118: Text Mining: Document Viewer - show tags (values and types)
  • AP-6116: Text Mining: Additional Tokenizers
  • AP-6043: Text Mining: IDF - normalization options
  • AP-5853: Text Mining: Strings to Document should not require title or author cols
  • AP-5815: Windows Installer should force uninstallation of previous installed versions
  • AP-5724: Add option for soft voting in Tree Ensemble and Random Forest predictor
  • AP-5710: Prediction Fusion Node: Add button "Add classes from column"
  • AP-5539: Text Mining: Strings 2 Document node should have column selection for publication date in dialog
  • AP-5514: k-means node to have option "enable hilite translation" to map between 2nd output (cluster center) and input
  • AP-4744: Unify file access checks in reader nodes
  • AP-4648: File handling nodes should support 'knime:' URIs (local only)
  • AP-4258: Writer nodes should add missing file extension in dialog
  • AP-4119: Add support for type converters in Java Snippet node
  • AP-3954: Tree Ensemble Learner to have options for stratified & equal size sampling
  • AP-3876: Writer node dialogs should be closable when location is controlled by flow variable
  • BD-244: (Big Data Extensions): Common remote (file) connector dialog should test connection (e.g. in HDFS)
  • BD-237: (Big Data Extensions): Improved Spark job error handling
  • BD-236: (Big Data Extensions): SSL support for HDFS/webHDFS/httpFS Connector nodes
  • BD-207: (Big Data Extensions): Create Spark Context node with new setting to hide "Context already exists" warning

Bug Fixes

  • AP-6705: Document Grabber no longer working with PubMed (PubMed API change)
  • AP-6611: PMML to RuleSet fails with "division undefined" error
  • AP-6605: String to Number doesn't support more than 2 billion rows
  • AP-6587: Security problem with cached db connections on KNIME server with Kerberos authentication
  • AP-6575: Gradient Boosted Trees Predictor fails in streaming mode
  • AP-6563: LibSVM predictor shuffles probabilities when table contains unseen classes
  • AP-6534: PNG images are blurred when shown in table view
  • AP-6533: Tooltip in "Memory" tab is not showing actual cell count threshold (problem reported in forum)
  • AP-6525: R Integration: Too many R Process Cleanup Threads during shutdown
  • AP-6523: R Integration: Rserve Processes are not terminated
  • AP-6492: CSV File History: Invalid entries may prevent table view from showing (thanks to menu entry 'export csv')
  • AP-6430: SQLServer VARBINARY type should not be read as string
  • AP-6419: Python Nodes: Error when using unicode characters in flow variables
  • AP-6407: REST nodes set wrong Content-Encoding
  • AP-6380: Python integration: "Execute failed: Column XXX has an unsupported type float"
  • AP-6356: Simple Streaming: Wrapped metanode appears to lock-up when it contains already executed nodes (should refuse execution instead)
  • AP-6351: File Meta info node fails with NPE for files on server
  • AP-6257: Association Rule Learner (Borgelt) shows negative RuleLift%
  • AP-6193: Potential deadlock under MacOS during startup
  • AP-6586: DataType.getName() method does not distinguish between SetCell and ListCell
  • AP-6484: Autocomplete Quickform view does not load
  • AP-6470: PhantomJS does not start under MacOS Sierra
  • AP-6143: License View doesn't open after update from AP 3.1 to AP 3.2
  • AP-6729: Group loops don't work with more than 100,000 groups
  • AP-6721: Equals and hashCode for MissingCells are not consistent
  • AP-6708: Joiner node fails to configure on compatible types (error: Type mismatch found of Joining Column Pair "Molecule (SMILES)" and "Molecule (SMILES)".This causes an empty output table.)
  • AP-6690: Save-As of workflows does not update mount point root in context
  • AP-6669: Delay spinner in GET Request node is disabled when workflow is re-opened
  • AP-6663: Java Snippet nodes use wrong encoding under Windows
  • AP-6661: Deserialization of JSONDataTable does not work
  • AP-6657: Testflow configuration node is not reset in streaming tests
  • AP-6645: Modular PMML nodes have incomprehensive error messages
  • AP-6641: Wrapping/unwrapping of meta nodes removes the node description
  • AP-6639: XPath dialog cannot identify tags in mixed text/tag lines
  • AP-6610: Long labels are cut off in Quickform Slider
  • AP-6590: Deserialization of JSONDataTableSpec does not work
  • AP-6578: DatabaseConnectionSettings should synchronize statement execution
  • AP-6574: JavaScript Scatter Plot: Selection is not retained after value is applied
  • AP-6572: Spark nodes stop working when JPMML nodes are installed
  • AP-6571: Spark nodes stop working when REST nodes are installed
  • AP-6568: Possible NPE when opening workflow on Linux with GTK 3+
  • AP-6522: Dragging PMML file onto workbench brings up deprecated PMML reader (expecting most recent reader)
  • AP-6506: Set look and feel in correct thread on startup
  • AP-6475: Random Forest nodes fail with NPE if a multiclass dataset contains missing values
  • AP-6442: Selection in javascript scatterplot is flaky
  • AP-6405: Javascript Scatter Plot: Changing columns for plot removes selection
  • AP-6395: JavaScript views should be assigned an aspect ratio based resize method in the layout by default
  • AP-6361: SAS Reader reads fields as Missing Values even though they exist
  • AP-6343: Wrapped metanode does not show outport names in tooltip
  • AP-6333: <first sheet with data> does not load first sheet's data to the File content table in XLS Reader
  • AP-6305: JSON Path node must not be executable without path expression
  • AP-5980: SPARQL Insert node cannot be interrupted
  • AP-5969: Streamed wrapped node fails on credentials flow variable in input
  • AP-5873: Selection in JS-Scatterplot not working on Linux
  • AP-5837: Fixed Width File Reader only allows selecting XML files in file chooser
  • AP-5682: Excel Writer writes file into installation directory when no path is provided
  • AP-5540: Text Mining: Strings 2 Document, adds "-" to author names if no first or lastname is specified
  • AP-5370: Image generated with JavaScript Line Plot displays x-axis labels in wrong order on Windows when a string column is selected
  • AP-5011: XLS Reader uses confusing row reference
  • AP-2658: XL Reader is off by one second
  • AP-2562: Some file browse dialogs need two clicks to gain focus
  • BD-272: (Big Data Extensions): Resetting Spark nodes throws exception if Spark context is closed
  • BD-271: (Big Data Extensions): Spark PMML Model Predictor throws exception when dialog is opened
  • BD-269: (Big Data Extensions): Spark Java Snippet node has problem if Spark version of incoming Spark context changes
  • BD-268: (Big Data Extensions): Table to Spark node requires too much memory for large tables
  • BD-256: (Big Data Extensions): Spark Job Server handles missing job results due to large jobs or job queue overflow more gracefully
  • BD-250: (Big Data Extensions): NullPointerException when destroying Spark context right after creation
  • BD-239: (Big Data Extensions): Create Spark Context node throws IllegalArgumentException when using Credentials Input
  • BD-235: (Big Data Extensions): XML syntax error in HiveLoad node description
  • BD-233: (Big Data Extensions): Fix null value handling in Fetch Row Spark Job
  • BD-227: (Big Data Extensions): Wait until remote spark context gets removed
  • BD-222: (Big Data Extensions): Destroy context with context per vm enabled causes job server to no longer accept new contexts
  • BD-219: (Big Data Extensions): Occasional error in spark extension: Execute failed: No Spark jobs for Spark version 1.6 found.
  • BD-187: (Big Data Extensions): HDFS file browsing is rooted at user home directory
  • BD-173: (Big Data Extensions): Spark Predictor with random forests model fails when jobserver is configured with jvm-per-context=true