Robin Lindgren - AI Tutor — STEM Prompting & Review

Q: Where is Robin based?

Robin is based in Stockholm, Sweden and prefers 100% remote projects.

Q: What languages does Robin speak?

Robin speaks the following languages: Swedish (Native), English (Advanced) .

Q: How many years of experience does Robin have?

Robin has at least 4 years of experience. During this time, Robin has worked in at least 5 different roles and for 5 different companies . The average length of individual experience is 1 year and 10 months . Note that Robin may not have shared all experience and actually has more experience.

Q: What is Robin's latest experience?

Robin's most recent position is AI Tutor — STEM Prompting & Review at Mindrift .

Q: What companies has Robin worked for in recent years?

In recent years, Robin has worked for Mindrift , RWS (TrainAI) , Outlier AI , TransPerfect / DataForce , and DataAnnotation.tech .

Q: Which industries is Robin most experienced in?

Robin is most experienced in industries like Information Technology (IT) , Professional Services , and Education .

Q: Which business areas is Robin most experienced in?

Robin is most experienced in business areas like Quality Assurance (QA) , Research and Development (R&D) , and Marketing .

Q: What is Robin's education?

Robin attended Stensund Folkhögskola for Behavioral Science Program .

Recommended expert

Stockholm, Sweden

Experience

Jan 2025 - Present

1 year 2 months

AI Tutor — STEM Prompting & Review

Mindrift

Authored prompts and evaluation rubrics for math/physics/CS; required clear step-by-step solutions, unit handling, and error-explanation notes.
Reviewed model outputs for correctness and reasoning quality; created calibration sets and difficulty bands.

Jan 2025 - Present

1 year 2 months

AI Data Annotator — Multimodal Labeling & QA

RWS (TrainAI)

Labeled text/image data to evolving schemas; stabilized label taxonomies; maintained rationale notes and edge-case logs.

Jan 2024 - Present

2 years 2 months

AI Linguistic Trainer (Swedish/English) — Criteria & Evaluation Design

Outlier AI

Built scoring criteria for helpfulness, safety, factuality, and style; turned policy into example-led guidelines.
Wrote adversarial prompts to probe ambiguity, safety, and factual precision; tuned thresholds to raise rater agreement.

Jan 2023 - Present

3 years 2 months

Content Creator — Swedish (Chaya Project)

TransPerfect / DataForce

Produced 1,000+ Swedish SMS/email samples; enforced strict style/safety guidelines.
Result: 98.85% QA pass across 815 reviewed tasks.

Jan 2022 - Present

4 years 2 months

Swedish AI Linguist & Trainer

DataAnnotation.tech

Reviewed/refined model outputs across registers; designed task-specific rubrics; led small calibration passes.

Industries Experience

See where this freelancer has spent most of their professional time. Longer bars indicate deeper hands-on experience, while shorter ones reflect targeted or project-based work.

Experienced in Information Technology (4 years), Professional Services (3 years), and Education (1 year).

Information Technology

Professional Services

Education

Business Areas Experience

The graph below provides a cumulative view of the freelancer's experience across multiple business areas, calculated from completed and active engagements. It highlights the areas where the freelancer has most frequently contributed to planning, execution, and delivery of business outcomes.

Experienced in Quality Assurance (4 years), Research and Development (4 years), and Marketing (3 years).

Quality Assurance

Research and Development

Marketing

Summary

LLM Trainer & Reasoning Specialist with 3+ years shaping high-fidelity prompts, evaluation rubrics, and gold-standard benchmarks across science/technology, legal principles, health & lifestyle, and multilingual use cases. I translate complex, policy-heavy instructions into clear, auditable workflows—including rationale notes, decision trees, inter-rater calibration sets, and defect taxonomies—that increase agreement, reduce rework, and raise throughput.

Proven record of quality at scale: 98.85% audited QA pass across 815 reviewed tasks, consistent SLA delivery in fully remote, fast-iteration environments. Strengths include reasoning-first prompt design (tiered variants, constraints, uncertainty language), evaluation operations (analytic/holistic rubrics, partial-credit logic, severity tagging, AQL spot checks), and safety/factuality governance (bias/fairness screens, non-advice framing, evidence-bounded prompts).

I partner closely with research and evaluation leads to convert model error analyses into derived prompts, adversarial test sets, clearer acceptance criteria, and versioned SOPs with complete audit trails. Tool-fluent with enterprise annotation and QA platforms; meticulous about metadata hygiene, template reuse, and documentation that scales across raters and projects.

Skills

Prompt Development (Reasoning-first): Craft Structured Prompts That Elicit Step-wise Reasoning; Write Tiered Variants (Basic → Advanced) With Clarity, Context, And Explicit Constraints.
Breadth Across Domains: Physics, Cybersecurity, Fitness, Automotive, Environment, Legal Principles; Plus Self-help/advice, Animals, Acg, Art, Books/reading.
Evaluation & Research Collaboration: Build Gold Standards, Define Scoring Rubrics (Holistic + Analytic), Run Inter-rater Calibration, And Iterate From Model Error Analyses.
Quality & Safety: Fact-checking Workflows, Citation Prompts, Boundary/adversarial Cases, Bias/safety Screens, And Minimal-edit Corrections.
Documentation: Versioned Sops, Rationale Logs, Error Taxonomies, Reproducible Templates For Prompt Suites And Benchmark Sets.
Physics (Mechanics & Em): Concept-scaffold Prompts (Givens → Unknowns → Formula Selection → Unit Consistency); Distractor-aware Mc Variants; Free-response With Rubric Describing Partial Credit And Common Slips (Sign Errors, Vector Decomposition, Sig-figs).
Cybersecurity: Threat-model Prompts (Stride/mitre Mapping), Log-snippet Analysis (Ioc Extraction), Secure Defaults/least-privilege Checklists, And Incident Synopsis Prompts With Evidence Tags And Scope Limits.
Fitness & Health: Evidence-bounded Prompts That Require Citing Guideline Sources; Progressive Overload Planning; Contraindication Checks And Scope-of-practice Guardrails (Non-diagnostic Framing).
Automotive: Troubleshooting Prompts Using Symptom Trees; Obd-ii Code Reasoning; Maintenance Schedules By Climate/load Profile; Safety Caveats.
Environment: Lifecycle Analysis Frames; Emissions Trade-off Calculators; Policy-vs-practice Prompts With Regional Variability And Uncertainty Notes.
Legal Principles (Non-advice): Issue-rule-application-conclusion (Irac) Skeletons For General Principles; Jurisdiction Flags; Cautionary “Not Legal Advice” Scaffolds; Case-fact Abstraction Prompts.
Built Benchmark Sets With Clear Acceptance Criteria, Partial-credit Rules, And Failure Modes; Logged Disagreements And Resolution Notes.
Ran Inter-rater Calibration With Seed Sets; Tracked Agreement Metrics; Iterated Instructions To Reduce Variance.
Partnered With Research Leads To Turn Model Misses Into Derived Prompt Variants (Counterfactuals, Negations, Distractor Density Controls).
Tools & Methods: Openai/anthropic/gemini Prompting; Json/csv Datasets; Google Sheets/docs For Rubrics; Labelbox/prodigy/cvat; Light Python For Validation; Markdown Templates; Fact-check Workflows.

Languages

Swedish

Native

English

Advanced

Education

Stensund Folkhögskola

Graduated with Distinction, Top 5% · Behavioral Science Program · Trosa, Sweden

Profile

Created

September 2025

Last Update

October 2025

Need a freelancer? Find your match in seconds.

Try FRATCH GPT

Frequently asked questions

Do you have questions? Here you can find further information.

Where is Robin based?

Robin is based in Stockholm, Sweden and prefers 100% remote projects.

What languages does Robin speak?

Robin speaks the following languages: Swedish (Native), English (Advanced).

How many years of experience does Robin have?

Robin has at least 4 years of experience. During this time, Robin has worked in at least 5 different roles and for 5 different companies. The average length of individual experience is 1 year and 10 months. Note that Robin may not have shared all experience and actually has more experience.

What roles would Robin be best suited for?

Based on recent experience, Robin would be well-suited for roles such as: AI Tutor — STEM Prompting & Review, AI Data Annotator — Multimodal Labeling & QA, AI Linguistic Trainer (Swedish/English) — Criteria & Evaluation Design.

What is Robin's latest experience?

Robin's most recent position is AI Tutor — STEM Prompting & Review at Mindrift.

What companies has Robin worked for in recent years?

In recent years, Robin has worked for Mindrift, RWS (TrainAI), Outlier AI, TransPerfect / DataForce, and DataAnnotation.tech.

Which industries is Robin most experienced in?

Robin is most experienced in industries like Information Technology (IT), Professional Services, and Education.

Which business areas is Robin most experienced in?

Robin is most experienced in business areas like Quality Assurance (QA), Research and Development (R&D), and Marketing.

What is Robin's education?

Robin attended Stensund Folkhögskola for Behavioral Science Program.

What is the availability of Robin?

Robin is immediately available for suitable projects.

What is the rate of Robin?

Robin's rate depends on the specific project requirements. Please use the Meet button on the profile to schedule a meeting and discuss the details.