Reinforcement learning for mixed-integer problems based on MPC