Bots From Extension: cfxai_clustering
CloudFabrix ML - Clustering
This extension provides 4 bots.
Bot @cfxusml:input-validation
Bot Position In Pipeline: Sink
CFX ML: Input validation for log clustering and save the report with 'result'
This bot expects a Restricted CFXQL.
Each parameter may be specified using '=' operator and AND logical operation
Following are the parameters expected for this Bot
Parameter Name | Type | Default Value | Description |
---|---|---|---|
columns | Text | Comma separated list of columns to be used for clustering. Default 'description' or 'message' | |
result | Text | clustering_input_validation_result | Name of dataset that holds the input validation results |
fail_on | Text | Fail on Errors or Warnings or Info if needed |
Bot @cfxusml:logclustering
Bot Position In Pipeline: Sink
CFX ML: log clustering and save the model with 'model_name'
This bot expects a Restricted CFXQL.
Each parameter may be specified using '=' operator and AND logical operation
Following are the parameters expected for this Bot
Parameter Name | Type | Default Value | Description |
---|---|---|---|
columns | Text | Comma separated list of columns to be used for clustering. Default 'description' or 'message' | |
min_cluster_size | Text | 50 | Minimum cluster size. Default is 50. |
min_sample_size | Text | 1 | Minimum sample size. Default is 1 |
model_name | Text | default | Name of the model for trained data. Default model name is 'default' |
devariablise_patterns | Text | Regex patterns for De-variabilisation of entities from the cluster columns. Should be passed as dictionary |
|
remove_vars | Text | yes | De-variabilise cluster columns. Allowed values 'yes' or 'no' |
entityid_column | Text | Column with unique id per row for updating Existing/trained rows with Cluster Label. This is to be passed when user executes the pipeline from OIA/AIA with fileupload or sources other than default |
|
job_name | Text | default | Name of the Job to be created. Default job name is 'default' |
skip_errors | Text | no | Specify 'yes' or 'no'. If 'yes', do not bailout if regression results in error. Check 'reason' field when it continues with an error. |
Example Pipelines Using this Bot
Bot @cfxusml:logpredict
Bot Position In Pipeline: Sink
CFX ML: log cluster prediction using pre-saved 'model_name'
This bot expects a Restricted CFXQL.
Each parameter may be specified using '=' operator and AND logical operation
Following are the parameters expected for this Bot
Parameter Name | Type | Default Value | Description |
---|---|---|---|
columns | Text | Comma separated list of columns. Default 'description' or 'message' | |
model_name | Text | default | Name of the model for trained data. Default model name is 'default' |
model_path | Text | default | Minio path for the trained model. Default lookup will be in tmp directory |
model_meta_dataset | Text | default | Model Meta dataset name. Must contain columns modelPath |
remove_vars | Text | yes | De-variabilize cluster columns. Allowed values 'yes' or 'no' |
Bot @cfxusml:output-validation
Bot Position In Pipeline: Sink
CFX ML: Input validation for log clustering and save the report with 'result'
This bot expects a Restricted CFXQL.
Each parameter may be specified using '=' operator and AND logical operation
Following are the parameters expected for this Bot
Parameter Name | Type | Default Value | Description |
---|---|---|---|
columns | Text | Comma separated list of columns to be used for clustering. Default 'description' or 'message' | |
result | Text | clustering_output_validation_result | Name of dataset that holds the output validation results |
fail_on | Text | Fail on Errors or Warnings or Info if needed |