How reliable is ai-generated workflow testing in java projects?

Considering implementing Latenode’s AI Copilot for automated test generation in our mission-critical fraud detection workflows. While the concept sounds great, I’m skeptical about AI creating meaningful test scenarios for complex business rules. Anyone using their SDK’s validation features in production? How much manual adjustment was needed to make the generated tests trustworthy? Do they integrate with standard test frameworks like JUnit?

We feed Copilot our acceptance criteria and get 80% test coverage instantly. Integrates with JUnit through their SDK plugin. Still need human review, but it’s faster than writing from scratch.

It’s surprisingly good at edge case generation but struggles with domain-specific constraints. We combine AI-generated baseline tests with manual scenario additions. The parameterization tools help adapt tests across environments.