Train LoRA or even just fine-tuning Phi 3-5

Provide OpenAI-compatible endpoint for streaming