Developing RDA Bots
1. Prerequisites
Developer must have following environments:
- Linux OS or MacOS
- Python 3 Runtime environment
- Docker Installed on the development system
- Install
rdac.py
as per instructions from this page
Bots can be developed in many programming languages. Bot implementation to RDAF interaction happens through language neutral messaging built on Protcol Buffers.
Following is support matrix and road map:
Language | Runtime Version | Availability |
---|---|---|
Python | 3.7 | Available Now |
Python | 3.9, 3.10 | Available Now |
Go | 1.18.3 | Available Now |
Java | August 2022 |
If you are looking for additional languages or runtime versions, or want some help during development, Join our Slack Community and let us know.
2. Bot Definition File
Bots are defined in YAML format. Following is a simple example Bot Package definition file. Going forward, this file will be referred to as bots.yml
bots.yml
file has three major sections: extension
, runtime
(Optional) and bots
.
2.1 extension
Section in bots.yml
extension
section is mandatory. Following are the parameters in extension
section:
Parameter |
Mandatory |
Description |
---|---|---|
namespace |
Yes | Namespace should identify developer's company or project uniquely. Namespace must be all lowercase characters may contain digits. If you wish to publish the developed bot packages to RDA Marketplace, CloudFabrix would issue a certificate which can be used to sign the bot package to identify the origin of the package. |
type |
Yes | type parameter identifies name of the package and must be unique within the specified namespace. Must be all lowercase letters and may contain digits. |
version |
Yes | Version of the Bot Package. RDA uses date based versioning convention. Date based convention format is YY.MM.DD . If the package was changed on June 10th 2022, version would be 22.6.10 . Any versioning format is accepted. RDA determines latest version by sorting alphabetically. |
description |
Yes | Descriptive details about the Bot Package. |
default_name |
A name to be used as prefix for the bot name. At the time of deploying bot package, user can choose any unique name. | |
publisher |
Yes | Name of the company or developer. |
support_email |
Email address to contact for any support related to this Bot Package | |
repo_url |
Git or any other publicly accessible URL which may provide additional information for the Bot Package, issue tracking and additional documentation | |
implementation |
Yes (if runtime is python) | This parameter must be a simple object with one parameter: code . The code parameter must specify the main python file which contains Bot implementations. All the implementation files must be inside the directory code which must be at the level as bots.yml This field is ignored if runtime is not python. |
config_template |
Configuration template for the bot package. This is required only if certain credentials are required for this package to work. For Example if the bot package pulls data from a certain service using APIs, and if that service needs credentials, format of the credentials must be defined in this object. See below for more details on config_template |
2.2 config_template
Details
Let us assume, the Bot Package needs to connect to a specific server & port. Let us say, the server expects authentication for APIs.
Configuration template will look something like below:
config_template | |
---|---|
In this template, parameters expected from user are hostname
, port
, api_token
, timeout
, verify
. All the parameters that start with $
are special purpose tags that define behavior or attach metadata.
$secure
: All the parameters listed under this tag are encrypted when stored. Also, user input would be masked.$mandatory
: All parameters listed under this tag must have a valid value.$labels
: This tag is object which provides parameter name to display label. If such a mapping is not specified, parameter name is used as label.
2.3 runtime
section in bots.yml
runtime
section in bots.yml
is an optional section that is used to specify the languange and other runtime environment specific to this extension.
Parameter | Mandatory | Description |
---|---|---|
type |
optional | Language and version in which bot extension runs on. Currently supported values are python3.7 , python3.9 , python3.10 and go1.18 , Default is python3.7 |
packages |
optional | Any python packages required to be installed for bot extension to run successfully. All the packages should be pegged to a version using ==. Applicable only if runtime type is pythonxxx. Ignored for other runtimes. |
2.4 bots
section in bots.yml
bots
section is mandatory. Bots must be a list of objects. At least one bot is required.
Each bot object will have following parameters.
Parameter | Mandatory | Description |
---|---|---|
name |
yes | Name of the bot. Must be unique within the bot package. Must be all lowercase. Must start with a letter and may contain hyphens and digits. |
description |
Yes | Description of the bot |
bot_type |
Yes | Valid values for bot source , source-any , sink , sink-updates .source : Bot outputs data. Can only be used at the beginning of a Pipeline Block. source-any : Bot outputs data. Can be used at any position in the pipeline. Does not consume input data. sink : Bot consumes data but does not modify it. sink-updates : Bot consumes input data and produces updated data |
model_type |
Yes | Only allowed value for this parameter is api . Future versions of Bot Package Development Kits will support additional types |
model_parameters |
model_parameters is a list of objects. Each object must contain following parametersname : Parameter name. Recommended to use letters, digits and underscore.description : Description for the parameter type : Valid values are text or datetime . Future versions of Bot Package Development Kit will support additional data types and enums. mandatory : Must be true or false . Declares whether the specified by name is mandatory |
3. Bot Implementation Code
3.1 Bot Implementation Code (Python)
Following a simple working example code that works with bots.yml
specified above.
3.2 Bot Implementation Code (Go Lang)
The bot implementation in golang needs to conform to the following structure:
- The extension implementation code needs to be placed in
code/rda
directory relative tobots.yml
file. - Place a
go.mod
file in this directory and add any dependent packages in this file (optional) If the dependant packages are not included in this file, they are automatically installed during build time. - Create a file (ex. extension.go) in the directory and make sure it is declared as
pacakge rda
- This file should have the following functions implemented.
package rda
type Extension struct {
Config map[string]interface{}
ExtInfo map[string]interface{}
}
func NewExtension(extInfo map[string]interface{}, config map[string]interface{}) Extension {
// This function should instantiate Extension struct and return it.
}
func (p Extension) ShutdownHook() {
// Any code that needs to be executed when the extension is being shutdown
}
func (p Extension) ExecuteBot(botName string, botDef map[string]interface{}, args map[string]interface{}) (*[]map[string]interface{}, error) {
// This is the main function that executes a bot. botName is passed as an argument so that this function can switch based on botName
}
Following is a sample code for golang:
package rda
import (
"errors"
"fmt"
"io/ioutil"
"net/http"
"runtime"
"runtime/debug"
"github.com/sirikothe/gotextfsm"
)
type Extension struct {
Config map[string]interface{}
ExtInfo map[string]interface{}
}
func NewExtension(extInfo map[string]interface{}, config map[string]interface{}) Extension {
return *(new(Extension))
}
func (p Extension) ShutdownHook() {
fmt.Printf("Shutfown Hook Called")
}
func (p Extension) parseTextFsm(botDef map[string]interface{}, args map[string]interface{}) (*[]map[string]interface{}, error) {
queryParams := args["query_params"].(map[string]interface{})
inputColumn := ""
if queryParams["input_column"] != nil {
inputColumn = queryParams["input_column"].(string)
}
url := ""
if queryParams["url"] == nil {
return nil, errors.New("url is not passed as a param to the bot")
}
url = queryParams["url"].(string)
if args["input_data"] == nil {
return nil, errors.New("input_data is not passed to the bot")
}
resp, err := http.Get(url)
if err != nil {
return nil, err
}
if resp.StatusCode != http.StatusOK {
return nil, errors.New(fmt.Sprintf("Error while getting %s. Status: %d\n", url, resp.StatusCode))
}
body, err := ioutil.ReadAll(resp.Body)
if err != nil {
return nil, err
}
template := string(body)
fsm := gotextfsm.TextFSM{}
err = fsm.ParseString(template)
if err != nil {
return nil, err
}
parser := gotextfsm.ParserOutput{}
inputData := args["input_data"].([]map[string]interface{})
var output []map[string]interface{}
for i := 0; i < len(inputData); i++ {
row := inputData[i]
if row[inputColumn] != nil {
input := row[inputColumn].(string)
err = parser.ParseTextString(input, fsm, true)
if err != nil {
return nil, err
}
output = append(output, parser.Dict...)
}
}
return &output, nil
}
func (p Extension) getVersions(botDef map[string]interface{}, args map[string]interface{}) (*[]map[string]interface{}, error) {
bi, ok := debug.ReadBuildInfo()
if !ok {
return nil, errors.New("Failed to read build info")
}
var output []map[string]interface{}
output = append(output, map[string]interface{}{"path": "go", "version": runtime.Version()})
for _, dep := range bi.Deps {
output = append(output, map[string]interface{}{"path": dep.Path, "version": dep.Version})
}
return &output, nil
}
func (p Extension) ExecuteBot(botName string, botDef map[string]interface{}, args map[string]interface{}) (*[]map[string]interface{}, error) {
switch botName {
case "parse":
return p.parseTextFsm(botDef, args)
case "get-versions":
return p.getVersions(botDef, args)
default:
return nil, errors.New(fmt.Sprintf("Unknown bot %s. Don't know how to execute", botName))
}
}
4. Building and Publishing First Bot Package
Make sure to place bots.yml
and myfile.py
in the following directory structure:
Let us assume you have your bot package source code in ~/rda_bots/myfirst_bot/
Make sure to Install rdac.py
4.1 Build and Deploy Bot Package
Change directory
Do Build:
If no errors occur, push the package and code to RDA Fabric:
Verify that package is added to RDA Fabric:
Output should be similar to following:
Name Version Description Has Dist Publisher Saved Time
----------------- --------- ----------------------------------------- ---------- ----------- --------------------------
botpkg_acme_demo1 22.6.6.2 Example RDA Bot Package for demonstration Yes ACME Inc 2022-06-01T20:42:09.551631
If bot package is not needed, use rdac.py bot-package-delete
to delete it from RDA Fabric.
4.2 Add Credentials to Vault
Once Bot Package has beed deployed, we need to add necessary credentials to RDA Fabric Vault.
Verify that `botpkg_acme_demo`` is a valid Secret Type in RDAF:
Output should be like this:
Now add credentials to vault:
This command will prompt for inputs. Name of the credential is mandatory and must be unique within RDAF deployment. Name must be all lowercase with optional hyphens. If the Bot Package has any additional credential requirement such as username
and password
, they should defined in config_template
(Section 2.2).
Configure Integration: Example RDA Bot Package for demonstration
Name*: demo
Add to Default Site Profile: no
Verify that secret has been added to vault using rdac.py secret-list | grep botpkg
4.3 Test the Bots with a Pipeline
To test the first bot get-data
in this package, we can use following simple pipeline:
Save the above pipeline snippet as test_pipeline_1.rda
Run the pipeline:
Note
_ If you omit --pipeline test_pipeline_1.rda
option, above command will prompt for pipeline. You can copy paste the pipeline contents and type .
in a new line.
_ --vault
option instructs the Pipeline execution tool to fetch any required credentials from the Vault. * --attr debug=demo1
attaches an attribute called debug
with value `demo``. This will help is while observing logs or traces for this test.
Output should be similar to:
Created Job: {} 2788d2bec2f84004b070c7761e3ee05b
Running:
Running:
Completed:
Completed the job in 3.2 seconds
Purging output for jobid: 2788d2bec2f84004b070c7761e3ee05b
Output has 1 rows, 1 columns
+----+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| | output |
|----+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0 | <p><h3>News that doesn't make you Snooze. <i>Generated by CloudFabrix RDA Bots</i></h3><ul>{%for row in rows%}<li><a href='{{ row.link }}'>{{ row.title }}</a></li>{% endfor %}</li></ul></p> |
+----+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```
=== "Example Traces"
` Host Pipeline JobID Seq Status Bot Dataframe Error Message ec5289f08e52 test_pipeline_20220610_171948 2788d2be 0 in-progress @demo:get-data 0x0 ec5289f08e52 test_pipeline_20220610_171948 2788d2be 1 in-progress 1x1 ec5289f08e52 test_pipeline_20220610_171948 2788d2be 2 completed 1x1 ec5289f08e52 test_pipeline_20220610_171948 2788d2be 3 trace 0x0 Saving output `
!!! note
Traces can be observed using `rdac.py watch-traces --attr debug=demo1`
This command keeps running until it is stopped using ++ctrl+c++