Master thesis: Partially Unsupervised Deep Meta-Reinforcement Learning