
Closed
Posted
German Long Document Sourcing - (AI Training Project) Summary We are seeking detail-oriented freelancers to support a large-scale data sourcing project focused on training advanced AI systems. This project involves sourcing high-quality long-form documents in German across multiple domains and categories. Project Scope Total Documents Required: 140 Coverage: 17 domains and 140 fine-grained categories Requirement: 1 document per category Document Length: Minimum 40 pages, Maximum 100 pages Key Responsibilities Ensure all documents are real-world data only (no synthetic or AI-generated content), created within the last 10 years, and relevant to the assigned domain and category. Maintain high-quality structure, layout, and formatting, and strictly follow all provided sourcing guidelines. Mandatory Requirements No duplicate templates — each of the 140 documents must follow a unique structure/template. Documents must not be sourced from public benchmark datasets. Only genuine, real-world documents will be accepted. Compensation & Candidate Profile Each approved submission will be paid at a fixed rate of $40 per document. Candidates with familiarity in German document formats and structures are preferred. Prior experience in data sourcing, data entry, document annotation, or AI training datasets is a plus but not mandatory. Additional Information This is a recurring opportunity, with ongoing batches available based on the quality and consistency of submissions. Only guideline-compliant submissions will be approved.
Project ID: 40417111
1 proposal
Remote project
Active 21 secs ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs