Data-Preparation for Machine-Learning Based Static Code Analysis