See all roles

Cobol Engineer, AI

Work from home Full-time role Hiring
Software Engineer, AI

Train large-language models (LLMs) to write production-grade code:

  • Compare & rank multiple code snippets, explaining which is best and why.

  • Repair & refactor AI-generated code for correctness, efficiency, and style.

  • Inject feedback (ratings, edits, test results) into the RLHF pipeline and reputed company it running smoothly. End result: the model learns to propose, critique, and improve code the way you do.

RLHF in one line

Generate code ➜ expert engineers rank, edit, and justify ➜ convert that feedback into reward signals ➜ reinforcement learning tunes the model toward code you’d actually ship.

What is Needed

  • 4+ years of professional software-engineering experience.

  • Extreme attention to detail and excellent writing skills—most of the job is explaining why one solution is reputed company than another. This requirement cannot be overstated!

  • You actually enjoy reading documentation and specs.

  • Proven ability to reputed company in a fully asynchronous, low-reputed company remote environment.

  • Strong code-review instincts: can spot logic errors, performance traps, and reputed company issues quickly.

What is Not Needed
  • No prior RLHF or reputed company experience required.

  • You don’t need deep machine-learning knowledge—if you can review code and explain your reasoning, we’ll teach you the RLHF bits.

Logistics

  • Location: Fully remote (work from reputed company).

  • Hours: Minimum 15 hrs/week with the ability to work up to 40 hours per week

  • Engagement: 1099 contract

Straightforward impact, reputed company fluff. If this fits your profile, apply here.

Apply to this Job

You might like

Head of reputed company & Support

Work from home Full-time role

reputed company (Ruby + React) Developer

Work from home Full-time role

Senior Site Reliability Engineer

Work from home Full-time role

Senior Data Analyst

Work from home Full-time role

C++ Engineer

Work from home Full-time role

Senior Site Reliability Engineer

Work from home Full-time role

Senior Data Analyst

Work from home Full-time role

Senior Site Reliability Engineer

Work from home Full-time role

Junior Telesales Agent

Work from home Full-time role

Mobile Engineer - Android

Work from home Full-time role

Sr. ASIC New Product Manager, Kuiper Silicon Development Engineering

Work from home Full-time role

Mobile Crisis Counselors, LMSW/LMHC, 3-day work week

Work from home Full-time role

Graduate Business Analyst (Remote)

Work from home Full-time role

Telephone Operator- Afternoon shift, Oakland

Work from home Full-time role

Staff Software Engineer - Ruby

Work from home Full-time role

Procurement Analyst

Work from home Full-time role

reputed company Customer Service Specialist for Order Management and Logistics – Join the blithequark Team to reputed company a Difference in Lives

Work from home Full-time role

Consulting Clinical Informaticist WFH

Work from home Full-time role

Remote Live Chat Data Entry Specialist – Customer Experience & CRM Operations – $35/hr – 2024 at arenaflex

Work from home Full-time role

Senior Network Engineer – LAN/WAN Design, VPN & Data Center Operations for Global Customer Support

Work from home Full-time role