Required Skills & Experience
--Strong hands-on experience in UI test automation and API test automation
--Proficient in designing, building, and maintaining scalable automation frameworks
--Comfortable working across multiple testing frameworks and toolchains
--Deep understanding of core automation principles including test design patterns, maintainability, reliability, and CI/CD integration
--Ability to evaluate, adapt, and implement automation solutions regardless of framework or technology stack
--Hands‑on ability to build AI‑driven testing and verification workflows
--Proficiency with Python and SQL (Postgres or SQL Server)
--Experience testing complex, business‑critical systems (ERP, finance, data‑heavy platforms)
--Comfort working with autonomous agents rather than manual testing
Nice to Have Skills & Experience
--Experience using Playwright for AI‑driven UI exploration
--Familiarity with LLMs and agent frameworks (OpenAI, Anthropic, LangChain/LangGraph)
--Knowledge of DSPy or other prompt‑optimization / evaluation frameworks
--Background in red teaming, adversarial testing, or stress testing
--Experience generating synthetic data at scale
--Exposure to vector databases (pgvector, Qdrant) and RAG systems
--Familiarity with data pipelines and orchestration tools (dbt, Airflow, Apache Hop)
--Experience with Docker, observability, or ML tooling (OpenTelemetry, Weights & Biases)
Job Description
You will join a specialized "Modernization Strike Team" focused on rewriting how enterprise software is built, tested, and operated using AI. We expect you to be a "Full-Stack Agentic QA Engineer." You should be comfortable taking a complex business requirement and building an AI-driven verification workflow that ensures 100% accuracy. you are the guardian of reliability in an autonomous world. You aren't here to write manual test cases or maintain brittle Selenium scripts; you are here to build the QA Agents that explore, test, and verify our ERP while we sleep
Responsibilities Include:
--Build and evolve an autonomous “QA Swarm” that tests the ERP UI, APIs, and financial workflows
--Design AI agents that explore the product, find broken flows, and validate reconciliations
--Replace brittle, manual tests with self‑healing, AI‑maintained test suites
--Create Judge / Eval Agents to assess agent accuracy, hallucinations, and data safety
--Reverse‑engineer legacy business logic and wrap it in modern, automated tests
--Stress‑test systems using synthetic ERP data and adversarial scenarios
--Act as the final gatekeeper of production reliability for agent‑driven features
Are you looking for remote jobs near your area? At Yulys, thousands of employers are looking for exceptional talent like yours. Find a perfect job now.