Choosing activation functions for multilayer networks