Python Developer for AI Prototype (LLM + State Comparison, Short Project)

Work from home Full-time role Hiring

Python Developer for AI Prototype (LLM + State Comparison, Short Project) ________________________________________ Description I’m looking for a developer to help build a lightweight AI prototype using OpenAI or Anthropic APIs. This is NOT a full product build. This is a focused prototype to test a specific idea. ________________________________________ Project Goal Build a simple Python-based system that: 1.Runs the same LLM task multiple times. 2. Captures outputs and any intermediate state (memory/logs). 3. Compares differences between runs. 4. Classifies differences into simple categories: o Stable o Boundary o Violation ________________________________________ What This Means Think:

Run the same prompt 5–10 times.
Log results.
Detect where outputs or stored data differ.
Label those differences.

That is it. ________________________________________ Technical Requirements Must have:

Python
Experience with OpenAI API or Anthropic API
Ability to build simple, clean scripts (no over-engineering)

Nice to have:

LangChain or similar frameworks.
Streamlit (for simple UI/dashboard).
Experience with logging or comparing outputs.

________________________________________ Important Constraints This should be:

Lightweight.
fast to build.
easy to understand.

Please DO NOT:

Design complex architectures.
build full systems.
over-engineer.

________________________________________ Deliverables

Python script or small app.
Ability to run repeated LLM tasks.
Stored logs of runs (JSON or similar).
Basic comparison logic between runs.
Simple classification output.

________________________________________ Timeline

3–7 days initial build
Max 1–2 weeks total

________________________________________ Engagement Style

Fixed-price or hourly (open to discussion)
Will start with a small paid test task before full project

________________________________________ Screening Question (Required) Please answer this: If you needed to run the same LLM task multiple times and compare outputs/state between runs, how would you build it quickly? ________________________________________ Who This Is For Ideal candidate:

Builds fast prototypes.
Comfortable with LLM APIs.
Prefers simple solutions over complex systems.

________________________________________ Bonus If this goes well, there may be follow-on work. Apply tot his job Apply To this Job

Apply

Python Developer for AI Prototype (LLM + State Comparison, Short Project)

You might like

Python Developer (AWS) Europe

Python Developer with Java & Cloud - Southwest Airlines

Python Developer with Numpy Investment Banking

Staff Python Software Engineer

Sr. Python Developer/Team Lead

Python Developer (Risk Technology)

Middle/Senior Python Engineer

Lead Software Engineer - Python | Backend

Fresher Python Developer

Senior Python Developer, GenAI, Innovation Labs - VP

Specialty Business Manager, Derm - Morgantown, WV

Actuary II, Personal Auto Predictive Modeling (Hybrid or Remote) (Remote, MA, US

Senior Lead, Marketing Services Operations

Medical Records Support - Remote | German/English | Training Provided

Remote | Audit Workpaper & Risk Assessment Specialist - $65-$95/hour

Remote Sales Closers & Appointment Setters

CART - Captionist

Remote Data Entry Specialist – Work From Home Position | arenaflex Flexible Data Entry Careers

Experienced Customer Service and Sales Representative – Work from Home Opportunity at arenaflex

Experienced Customer Sales Representative – Remote Opportunity at arenaflex