# Procedure Configuration

A procedure configuration object is used to create or load a procedure.

It is a JSON object that looks like this:

{
"id": <id>,
"type": <type>,
"params": {
<params>
}
}

• id is a string that defines the URL at which the procedure will be available via the REST API
• type is a string that specified the procedure's type (see below)
• params is an object that configures the procedure, and whose contents will vary according to the type

Not all three of these fields are required in all contexts:

• one or both of id and type must be specified
• if only id is specified, MLDB will assume this is a pre-existing procedure and will try to load it (an error will ensue if it doesn't already exist)
• if type is specified, MLDB will assume that the procedure doesn't exist yet and will try to create it (an error will ensue if it already exists)
• if type is specified without id, an id will be auto-generated
• if type is specified with id, the procedure will be created with the specified id unless a procedure already exists with that id
• if type is specified, then a corresponding params function must be specified if the type requires it

The following types of procedures are available:

TypeDescriptionDoc
classifier.experimentTrain and test a classifier[doc]
classifier.testCalculate the accuracy of a classifier on held-out data[doc]
classifier.trainTrain a supervised classifier[doc]
export.csvExports a dataset to a target location as a CSV[doc]
import.gitImport a Git repository's metadata into MLDB[doc]
import.jsonImport a text file with one JSON per line into MLDB[doc]
import.sentiwordnetImport a SentiWordNet file into MLDB[doc]
import.textImport from a text file, line by line.[doc]
import.word2vecImport a word2vec file into MLDB[doc]
kmeans.trainSimple clustering algorithm based on cluster centroids in embedding space[doc]
meltPerforms a melt operation on a dataset[doc]
mongodb.importImport a dataset from MongoDB[doc]
permuter.runRun a child procedure with permutations of its configuration[doc]
probabilizer.trainTrains a model to calibrate a score into a probability[doc]
randomforest.binary.trainTrain a supervised binary random forest[doc]
statsTable.bagOfWords.trainCreate statistical tables of trials against outcomes for bag of words[doc]
statsTable.trainCreate statistical tables of trials against outcomes[doc]
summary.statisticsCreates a dataset with summary statistics for each columns of an input dataset[doc]
svd.trainTrain a SVD to convert rows or columns to embedding coordinates[doc]
tfidf.trainPrepare data for a TF-IDF function[doc]
transformApply an SQL expression over a dataset to transform into another dataset[doc]
tsne.trainProject a high dimensional space into a low-dimensional space suitable for visualization[doc]