The BetaML.Nn Module

BetaML.Nn — Module

BetaML.Nn module

Implement the functionality required to define an artificial Neural Network, train it with data, forecast data and assess its performances.

Common type of layers and optimisation algorithms are already provided, but you can define your own ones subclassing respectively the AbstractLayer and OptimisationAlgorithm abstract types.

The module provide the following types or functions. Use ?[type or function] to access their full signature and detailed documentation:

Model definition:

DenseLayer: Classical feed-forward layer with user-defined activation function
DenseNoBiasLayer: Classical layer without the bias parameter
VectorFunctionLayer: Layer whose activation function run over the ensable of its nodes rather than on each one individually. No learnable weigths on input, optional learnable weigths as parameters of the activation function.
ScalarFunctionLayer: Layer whose activation function run over each node individually, like a classic DenseLayer, but with no learnable weigths on input and optional learnable weigths as parameters of the activation function.
ReplicatorLayer: Alias for a ScalarFunctionLayer with no learnable parameters and identity as activation function
ReshaperLayer: Reshape the output of a layer (or the input data) to the shape needed for the next one
PoolingLayer: In the middle between VectorFunctionLayer and ScalarFunctionLayer, it applyes a function to the set of nodes defined in a sliding kernel. Weightless.
ConvLayer: A generic N+1 (channels) dimensional convolutional layer
GroupedLayer: To stack several layers into a single layer, e.g. for multi-branches networks
NeuralNetworkEstimator: Build the chained network and define a cost function

Each layer can use a default activation function, one of the functions provided in the Utils module (relu, tanh, softmax,...) or one provided by you. BetaML will try to recognise if it is a "known" function for which it sets the exact derivatives, otherwise you can normally provide the layer with it. If the derivative of the activation function is not provided (either manually or automatically), AD will be used and training may be slower, altought this difference tends to vanish with bigger datasets.

You can alternativly implement your own layer defining a new type as subtype of the abstract type AbstractLayer. Each user-implemented layer must define the following methods:

A suitable constructor
forward(layer,x)
backward(layer,x,next_gradient)
get_params(layer)
get_gradient(layer,x,next_gradient)
set_params!(layer,w)
size(layer)

Model fitting:

fit!(nn,X,Y): fitting function
fitting_info(nn): Default callback function during fitting
SGD: The classical optimisation algorithm
ADAM: A faster moment-based optimisation algorithm

To define your own optimisation algorithm define a subtype of OptimisationAlgorithm and implement the function single_update!(θ,▽;opt_alg) and eventually init_optalg!(⋅) specific for it.

Model predictions and assessment:

predict(nn) or predict(nn,X): Return the output given the data

While high-level functions operating on the dataset expect it to be in the standard format (nrecords × ndimensions matrices) it is customary to represent the chain of a neural network as a flow of column vectors, so all low-level operations (operating on a single datapoint) expect both the input and the output as a column vector.

The BetaML.Nn Module

Module Index

Detailed API