Sarah Kim
@sarah-k · 23h ago
Questions
Llama 2 vs GPT-4: Output Quality Issues
Has anyone else noticed a significant inconsistency in output quality between open-source LLMs like Llama 2 70B and proprietary models like GPT-4 Turbo, particularly regarding generating detailed user stories with accurate task breakdowns – I’m seeing ~30% lower success rates with the open-source models when prompted the same way?