Python Developer for AI Prototype (LLM + State Comparison, Short Project)
Python Developer for AI Prototype (LLM + State Comparison, Short Project) ________________________________________ Description I’m looking for a developer to help build a lightweight AI prototype using OpenAI or Anthropic APIs. This is NOT a full product build. This is a focused prototype to test a specific idea. ________________________________________ Project Goal Build a simple Python-based system that: 1.Runs the same LLM task multiple times. 2. Captures outputs and any intermediate state (memory/logs). 3. Compares differences between runs. 4. Classifies differences into simple categories: o Stable o Boundary o Violation ________________________________________ What This Means Think:
- Run the same prompt 5–10 times.
- Log results.
- Detect where outputs or stored data differ.
- Label those differences.
That is it. ________________________________________ Technical Requirements Must have:
- Python
- Experience with OpenAI API or Anthropic API
- Ability to build simple, clean scripts (no over-engineering)
Nice to have:
- LangChain or similar frameworks.
- Streamlit (for simple UI/dashboard).
- Experience with logging or comparing outputs.
________________________________________ Important Constraints This should be:
- Lightweight.
- fast to build.
- easy to understand.
Please DO NOT:
- Design complex architectures.
- build full systems.
- over-engineer.
________________________________________ Deliverables
- Python script or small app.
- Ability to run repeated LLM tasks.
- Stored logs of runs (JSON or similar).
- Basic comparison logic between runs.
- Simple classification output.
________________________________________ Timeline
- 3–7 days initial build
- Max 1–2 weeks total
________________________________________ Engagement Style
- Fixed-price or hourly (open to discussion)
- Will start with a small paid test task before full project
________________________________________ Screening Question (Required) Please answer this: If you needed to run the same LLM task multiple times and compare outputs/state between runs, how would you build it quickly? ________________________________________ Who This Is For Ideal candidate:
- Builds fast prototypes.
- Comfortable with LLM APIs.
- Prefers simple solutions over complex systems.
________________________________________ Bonus If this goes well, there may be follow-on work. Apply tot his job Apply To this Job