Multi-Token Prediction: A Promising New Idea For LLM’s