
Closed
Posted
I have a corpus of Bangla text that must be tagged for part-of-speech according to a guideline I will share after we start. The material arrives in small batches—usually a CSV with an “id” and “sentence” column—and I need each sentence tagged with the exact label set defined in my document. Accuracy and consistency are critical; I will run periodic spot checks and measure inter-annotator agreement. Feel free to work with any tools you prefer—TagAnt, CoNLL editors, custom Python scripts, or even manual spreadsheet work—as long as the final file is returned in the same order and in UTF-8. If you already have experience annotating Bangla or other Indic languages, let me know; it will help me schedule the review process. Deliverables for every batch: • Annotated file (CSV or CoNLL) matching my template • Short log of uncertain or edge-case decisions I will provide the full guideline, examples, and a small trial set once we agree on timing. Let’s get started on the first batch right away.
Project ID: 40417548
10 proposals
Remote project
Active 14 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average ₹870 INR/hour for this job

Your inter-annotator agreement metric will collapse if your taggers interpret compound verbs and postpositions inconsistently. Bangla morphology creates ambiguity between noun-verb boundaries that breaks POS models when annotators guess instead of following strict decision trees. Before I commit to batch processing, I need clarity on two things: What's your target Kappa score for agreement checks, and does your guideline cover sandhi resolution rules for merged tokens? If you're measuring IAA but the spec doesn't address ক্ষ্ণ-type conjuncts or light verb constructions, you'll get 70% agreement at best and waste review cycles. Here's the execution plan: - BANGLA LINGUISTICS: Apply morphological parsing rules to handle verbal noun ambiguity and ensure postposition tagging matches your label hierarchy, not generic UD standards. - PYTHON AUTOMATION: Build a validation script that flags inconsistent tags within each batch before delivery—catches tense marker errors and reduces your spot-check load by 60%. - DATA PROCESSING: Preserve UTF-8 encoding and sentence order while handling zero-width joiners that break naive CSV parsers, so your pipeline doesn't corrupt diacritics. - EDGE-CASE LOGGING: Document every ambiguous token with linguistic justification (not just "unsure"), creating a decision database that improves consistency across future batches. - NLP TOOLING: If you're training a model downstream, I'll structure annotations in CoNLL-U format with dependency hints so your parser doesn't choke on non-projective trees. I've annotated 40K tokens across Hindi and Bengali corpora for two academic NLP projects, hitting 0.89 Kappa on morphologically rich text. I don't take annotation work where the guideline has gaps—let's review your spec and trial set in a 20-minute call before I process the first batch. That way we catch interpretation conflicts now instead of after you've paid for 10 batches of inconsistent tags.
₹900 INR in 30 days
6.2
6.2

Hello, I can help with your Bangla POS tagging project. When you share the guideline and a sample batch I will annotate each sentence according to your exact tagset, keeping the original order and UTF 8 format. I can deliver the output as CSV or CoNLL as you prefer. Each batch i will include this Clean annotated file Short notes on any unclear or edge cases I am ready to start with the first trial batch whenever you send it. Thanks Mamun
₹750 INR in 40 days
5.7
5.7

Hi, I would like to grab this opportunity and will work till you get 100% satisfied with my work. I just applied after read your job posting carefully and I believe that I am good fit to your project. I'm a serious bidder. I will satisfy you with my high skills! I am an expert which have 10+ years of experience on Mobile App Development, iPhone, User Interface / IA, PhoneGap, Mobile App Testing, Swift, Google Maps API, Web Development, iOS Development, Mobile Development I am looking forward to meet you to discuss the further detail about this project. Looking forward to hearing from you. Warm Regards
₹750 INR in 40 days
5.1
5.1

Hi,I’m a seasoned Applied ML Engineer (6+ yoe)with experience in NLP annotation,text cleaning,POS/NER-style tagging,& multilingual/Indic-language data pipelines Proposed Workflow: I will implement a unified annotation process for Bangla POS tagging to ensure consistency across all batches. >>Guideline Alignment: Comprehensive review of your tagset & trial batches to master rules for compound words,particles,postpositions,& entities. >>Data Integrity: All processing in UTF-8 using a hybrid manual/Python workflow to preserve original IDs & sentence order while ensuring valid CoNLL/CSV formatting. >>Linguistic Precision: Maintenance of an Uncertainty Log to track edge cases like reduplication,multi-word expressions,& ambiguous tokens. Quality Assurance: Every delivery will pass internal validation checks: >>Verification of valid labels & zero missing tokens >>UTF-8 safety & structural preservation >>Cross-batch consistency for repeated tokens >>Documentation of all borderline or interpreted cases Relevant Experience: >>Linguistic Processing:Expert in multilingual text preprocessing,Unicode normalization & tokenization for Indic-language workflows >>NLP Tasks:Proven experience in POS/NER-style annotation,entity extraction & semantic classification >>Quality Assurance:Developed high-consistency dataset preparation pipelines where annotation integrity & script-specific nuances are critical
₹750 INR in 40 days
4.3
4.3

The fastest path to getting your Bangla text accurately tagged is through a clear, meticulous process. I understand you need each sentence labeled precisely according to your guidelines, and consistency is non-negotiable. I can definitely help with that. With direct experience in annotating various texts, including Bangla, I'm comfortable using the tools you mentioned,be it TagAnt or custom scripts,ensuring the output meets your specified format. I’ll also maintain detailed logs for any uncertainties, which should make your spot checks smoother. Just a thought: have you considered implementing a quick review of a small subset of the annotated text at first? It could streamline the feedback loop. I’m ready to dive in and can deliver the initial batch within 10 days. Can we hop on a 10-minute call this week?
₹800 INR in 40 days
0.0
0.0

Hi, As a native Bangla speaker with a strong understanding of grammar and linguistic structure, I can accurately perform part-of-speech (POS) tagging for your corpus with high consistency and attention to detail. I am comfortable working with CSV/CoNLL formats and can use tools like spreadsheets or custom workflows to ensure precise annotation following your guideline. I understand the importance of maintaining strict label consistency and preserving the original sentence order in UTF-8 format. For each batch, I will: • Carefully tag each token based on your defined label set • Maintain consistency across all sentences • Provide a short log highlighting uncertain or edge cases for clarity • Perform self-review checks to improve annotation quality and agreement Being fluent in Bangla, I can better interpret context, ambiguity, and grammatical nuances, which is essential for high-quality POS tagging. I am ready to start with the trial set immediately and can handle regular batches reliably within your timeline. Looking forward to working with you on this project.
₹750 INR in 40 days
0.0
0.0

Hello, I have 5+ years of experience working with data processing and NLP-related tasks, including text annotation and structured dataset handling. I understand the importance of accuracy, consistency, and strict adherence to tagging guidelines, especially for linguistic projects like POS tagging. I will follow your annotation standards precisely, maintain consistency across batches, and ensure clean, well-structured outputs in the required format. I can also provide logs for edge cases to support quality checks and improve agreement. I focus on detail, reliability, and timely delivery. Let’s connect and start with the first batch.
₹1,000 INR in 40 days
0.0
0.0

Bengaluru, India
Member since Aug 13, 2025
₹100-400 INR / hour
min ₹2500 INR / hour
$30-250 USD
₹12500-37500 INR
₹12500-37500 INR
$30-250 USD
$15-25 USD / hour
$30-250 USD
€6-12 EUR / hour
$10-30 USD
₹37500-75000 INR
$45 USD
£20-250 GBP
₹750-1250 INR / hour
$250-750 USD
₹750-1250 INR / hour
$45 USD
$250-750 USD
$3-10 USD / hour
₹1500-12500 INR
₹4000-6000 INR
₹150000-250000 INR
$15-25 AUD / hour
₹400-750 INR / hour
₹12500-37500 INR