publication

Creating, Managing, and Understanding Large, Sparse, Multitask Neural Networks

H Sikka (2020)

Apr 1, 2020 1 min read

In: publication

One of the popular directions in Deep Learning (DL) research has been to build larger and more complex deep networks that can perform well on several different learning tasks, commonly known as multitask learning. This work is usually done within specific domains, e.g. multitask models that perform captioning, translation, and text classification tasks. Some work has been done in building multimodal/crossmodal networks that use deep networks with a combination of different neural network primitives (Convolutional Layers, Recurrent Layers, Mixture of Expert layers, etc). This paper explores various topics and ideas that may prove relevant to large, sparse, multitask networks and explores the potential for a general approach to building and managing these networks. A framework to automatically build, update, and interpret modular LSMNs is presented in the context of current tooling and theory.

Full Preprint found here.

Written by

Manifold Team

View all posts

Creating, Managing, and Understanding Large, Sparse, Multitask Neural Networks

Manifold Team

MultiNet v1.0: A Comprehensive Benchmark for Evaluating Multimodal Reasoning and Action Models Across Diverse Domains

An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models

Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments