Crossing Hurdles · 10 hours ago
Application Developer | $80/hr Remote
Crossing Hurdles is seeking a Software Engineering & Systems Design Expert to evaluate LLM-generated responses and ensure their accuracy and quality. The role involves reviewing technical explanations, executing code, and providing structured feedback on model outputs.
Staffing & Recruiting
Responsibilities
Evaluate LLM-generated responses to software engineering and coding questions for accuracy, reasoning quality, and completeness
Review and validate technical explanations, system design discussions, and code solutions across varying complexity levels
Execute and test code to verify correctness, performance, and edge-case handling
Identify logical errors, inefficiencies, bugs, or misleading explanations in model-generated outputs
Annotate responses with structured feedback highlighting strengths, weaknesses, and factual inaccuracies
Assess code quality, readability, algorithmic soundness, and adherence to engineering best practices
Ensure AI outputs align with expected conversational behavior and system guidelines
Apply consistent evaluation standards using defined taxonomies, benchmarks, and review criteria
Qualification
Required
BS, MS, or PhD in Computer Science or a closely related technical field
Significant real-world experience in software engineering or systems design roles
Expertise in at least one major programming language (e.g., Python, Java, C++, JavaScript, Go, Rust)
Ability to independently solve medium to hard-level algorithmic problems
Experience contributing to open-source projects with merged pull requests
Strong familiarity with using LLMs for coding and understanding their strengths and limitations
Excellent attention to detail and ability to evaluate complex technical reasoning
Fluent in English with strong written communication skills
Company
Crossing Hurdles
At Crossing Hurdles, we specialise in customised recruitment and staffing solutions designed to drive success for businesses and professionals.
Funding
Current Stage
Early StageCompany data provided by crunchbase