
Fechado
Publicado
Pago na entrega
I’m preparing a training corpus for a large-language-model experiment and need help sourcing mature, well-maintained private repositories on GitHub. I’m interested only in projects that clearly demonstrate active development and non-trivial code volume: • at least 10 000 lines of code • 50 plus commits in the main branch • a minimum of 5 merged pull requests You must be the owner of the repo Language and domain are flexible—Python, JavaScript, Java and beyond are all fine so long as the metrics above are met. What I need from you 1. A short list of candidate repositories, each with its GitHub URL and a brief note of the primary language. 2. A quick validation snapshot for every repo showing line-count, commit count, and pull-request count (a link to the GitHub insights page or a simple `cloc`/`git log` summary is enough). 3. Once I approve the shortlist, a zipped dump or Git clone of each repo (including history) so I can ingest it directly into my pipeline. Acceptance criteria • Every repo you deliver satisfies all three numeric thresholds. • The archive or clone is complete and uncorrupted. • Your validation notes match what I see on GitHub. If you already maintain or know of repositories that match, let’s talk—I can move quickly once the quality is confirmed.
ID do Projeto: 40180112
27 propostas
Projeto remoto
Ativo há 1 dia
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
27 freelancers estão ofertando em média ₹53.370 INR for esse trabalho

Hi, I am an IIT Grad, ex-BFSI and worked at fortune 500 companies. I will make it a reality for you. As a Project Manager, I will analyze public repositories on GitHub, utilizing keywords, stars, and commits to identify active projects meeting the specified criteria, then reach out to owners for permission. Kindly click on the chat button so we can discuss and get started. Will share you my prior projects done and my resume too. I have been doing freelancing since 2019 worked at top MNCs in both USA and India. Lets connect
₹37.500 INR em 7 dias
5,4
5,4

With your expansive search criteria for mature, actively maintained codebases on GitHub, my 7+ years of experience in diverse software domains makes me an ideal candidate to complete this task. Having worked extensively with Java, JavaScript, and Python, and a profound understanding of Software Architecture, I'm equipped to effortlessly meet the project requirements. Additionally, my fluent command of other programming languages such as Ruby on Rails, C++, and more give me an extra edge in searching and filtering high-quality repositories. In addition to my technical abilities, I emphasize client satisfaction by delivering top-tier outputs while adhering to deadlines. With this in mind, once we have the shortlist you approve of,I guarantee timely delivery in zipped dump or Git clone format for easy integration into your pipeline. To summarize, through my- in-depth knowledge of various languages and frameworks, proven experience in software development, and an established reputation for meeting client expectations,I am equipped to ensure that you receive the high-quality GitHub codebases you need for your experiment.
₹37.500 INR em 7 dias
6,4
6,4

Hello there, I reviewed your project High-Quality GitHub Codebases Wanted and understood the requirements at a high level. I focus on delivering clear, stable, and maintainable solutions aligned with the actual scope, I can work with Java, JavaScript, Python and follow a clean development process with proper structure and error handling. If this aligns with what you’re looking for, please come to chat to discuss further. Best regards
₹37.500 INR em 7 dias
4,4
4,4

Hello, I’m Rahul Singh from Team Velora, and Team Velora has been running for 3+ years managing mature, large-scale GitHub repositories. We understand your requirement for private repositories with ≥10,000 lines of code, 50+ commits, and 5+ merged PRs, along with complete validation snapshots for each. Our team can provide a shortlist of eligible repos with full history ready for your LLM training pipeline. Let’s come to the chat box to share the candidate repositories and validation details.
₹54.000 INR em 22 dias
4,3
4,3

Hello Tejas V., I went through your project description and it seems like that I am a great fit for this job. I have an expert team with many years of experience in Java, JavaScript, Python, Project Management, Software Architecture, Software Development, Git, Open Source, GitHub, Software Engineering. Lets connect in chat so that we discuss further. Thank You
₹56.300 INR em 7 dias
3,7
3,7

Hi there, I can help you source mature, well-maintained private GitHub repositories that meet your LLM training criteria. I have experience managing and curating large codebases and can quickly validate repositories against your thresholds: ≥10,000 lines of code, ≥50 commits, and ≥5 merged pull requests. Here’s how I’ll approach your project: Compile a shortlist of candidate repositories I own, including GitHub URLs and primary languages. Provide validation snapshots for each repo—line counts, commit counts, and pull-request counts—via cloc, git log, or GitHub insights links. Upon your approval, deliver complete zipped archives or Git clones including full history, ready for ingestion into your pipeline. I ensure all repos meet your numeric thresholds, are fully validated, and the archives are complete and uncorrupted. I’m prompt, meticulous, and familiar with multi-language codebases, making me well-suited to deliver high-quality material for your LLM experiment. I’m ready to get started immediately once you approve the approach and can provide sample validation notes for transparency.
₹56.250 INR em 7 dias
3,4
3,4

Hello, I understand this as a curation and verification task for high-quality, production-grade GitHub repositories intended for LLM training rather than a simple repo search. I would approach this systematically: define a diverse but relevant candidate pool → screen repositories against hard quantitative thresholds (LOC, commits, merged PRs) → manually verify activity and code maturity → document validation evidence with GitHub insights or git log summaries → prepare clean, complete Git clones including full history. The final delivery will be a vetted shortlist with clear metrics you can independently verify, followed by correctly archived repositories ready for direct ingestion into your pipeline, with zero ambiguity on quality or completeness.
₹50.000 INR em 7 dias
2,9
2,9

Hello, I’ve carefully reviewed your project requirements and clearly understand the tasks involved. I have 13 years of experience and strong expertise in the exact skills this project requires. I have successfully delivered similar projects before and can share relevant samples if needed. I will complete this within your expected timeline while maintaining quality and clear communication. I look forward to working with you and contributing sincerely to your project’s success.
₹56.250 INR em 7 dias
3,1
3,1

I have strong experience sourcing and validating high-quality open-source GitHub repositories for ML and LLM training. I can shortlist mature projects that meet your exact thresholds, provide clear validation snapshots (LOC, commits, PRs), and deliver complete, verified clones with full history ready for ingestion.
₹56.250 INR em 7 dias
2,8
2,8

With over 9 years of experience as a lead developer and a deep understanding of various programming languages including JavaScript, I am well-positioned to help you find the high-quality GitHub repositories you need. Though my proficiency extends to Mobile App Development, Java, PHP, HTML/CSS and more, my competency is about helping clients turn their IDEAS TO REALTY –- just as you envision for this project. My previous experience in web and Mobile App development has equipped me with the technical acumen required for this gig – including comprehensive knowledge of critical metrics such as line count, commit count, and pull-request count. I am particularly adept at searching and validating GitHub repositories for clients' specific requirements. In addition to my fitting skill set and extensive experience, I offer effective cost management and beneficial post-delivery support - a complimentary 3-month assistance that ensures your complete satisfaction. Rest assured that I’ll not only deliver repositories which satisfy all of your specified numerical thresholds, but also that the archive or clone will be fully integrated into your pipeline without any corruption. Thank you for considering my bid!
₹56.250 INR em 7 dias
2,8
2,8

I can provide a vetted shortlist of my own actively maintained GitHub repositories that meet your exact thresholds for code volume, commits, and merged pull requests. I’ll include clear validation snapshots for each repo and deliver clean, complete clones with full history once approved. Muzammil
₹37.500 INR em 1 dia
1,6
1,6

Dear Client, I can provide owner-maintained, mature GitHub repositories that meet all your criteria (10k+ LOC, 50+ commits on main, 5+ merged PRs). I maintain and have access to multiple actively developed private repos across Python, JavaScript, and Java, suitable for LLM training use cases. I’ll deliver: A vetted shortlist with GitHub URLs and primary language Clear validation snapshots (cloc, git log, PR history / Insights links) Clean Git clones or zipped archives with full history, ready for ingestion All repositories will strictly meet the numeric thresholds, with validation matching GitHub exactly. I can move fast once you review the shortlist. Best regards, WiredAI Venture
₹45.000 INR em 7 dias
1,4
1,4

✔ I deliver 100% work — 99.9% is not for me. ✔ Workflow Diagram Repository Sourcing ⟶⟶ Code Volume & Activity Validation ⟶⟶ Shortlist Approval ⟶⟶ Git Clone / Archive Delivery ⟶⟶ Pipeline Ingestion Support Key Highlights ✔ Curated open-source repositories — mature, actively maintained, and non-trivial code volume. ✔ Quantitative validation — line counts, commit counts, and merged pull request metrics verified for each repo. ✔ Multi-language flexibility — Python, JavaScript, Java, and others supported. ✔ Documentation included — GitHub URL, primary language, and snapshot of repo health for every candidate. ✔ Full repository delivery — zipped archives or git clones including complete commit history for ingestion. ✔ Transparent verification — validation notes match GitHub insights so you can confirm independently. ✔ Rapid iteration — shortlist can be approved quickly, and delivery follows immediately. Best Regards, Hamza Open-Source Curator | Git & Repository Management | LLM Training Pipeline Support
₹40.000 INR em 30 dias
0,0
0,0

Hello, I can help you source mature, actively maintained open-source GitHub repositories that meet your exact numeric thresholds and are suitable for LLM training. I’ll take a verification-first approach to ensure every candidate fully satisfies: ≥10,000 lines of code ≥50 commits on the main branch ≥5 merged pull requests What you’ll receive: A curated shortlist of repositories with GitHub URLs and primary language A clear validation snapshot for each repo (GitHub Insights links and/or cloc + git log summaries) After approval, a complete Git clone or zipped archive (including history), verified and uncorrupted I’m comfortable working across Python, JavaScript, Java, and other ecosystems, and I double-check metrics before submission so there’s no mismatch during acceptance. I can move quickly once we align on the shortlist size and preferred languages. Looking forward to collaborating. Best regards, Muhammad Aqib Ali
₹50.250 INR em 3 dias
0,0
0,0

Dear Client, Good afternoon . How are you? I hope this proposal finds you well. I'M A CERTIFIED & EXPERIENCED EXPERT This is to inform you that I have KEENLY gone through your project description, CLEARLY understood all the project requirements as instructed in your project proposal and this is to let you know that I will perfectly deliver as desired. Being in possession of all stated required skills, (Software Development, Python and JavaScript), as this is my field of professional specialization having completed all certifications and developed adequate experience in the respective field, I hereby humbly request you to consider my bid for professional, quality and affordable services that meet all your requirements. I always guarantee timely delivery and unlimited revisions where necessary hence you are assured of utmost satisfaction when working with me. Please send me a message so that we can discuss more and seal the project. THANK-YOU & WELCOME.
₹75.000 INR em 1 dia
0,0
0,0

❤️❤️❤️Timeline:1day | Full-time availability in your time zone❤️❤️❤️ ⭐ If you award me, your smile shows up ⭐ I can source and validate mature, owner-maintained GitHub repositories that meet your exact LLM training criteria. +Deliverables Shortlist of qualifying repos (10k+ LOC, 50+ commits, 5+ merged PRs) Validation proof for each (GitHub Insights / cloc / git logs) Full Git clone or zipped repo (with history) after approval +Why me Strong Python & JavaScript background Experience auditing large, active codebases Fast, accurate delivery +Timeline Shortlist in 1–2 days Final delivery same day after approval
₹56.250 INR em 1 dia
0,0
0,0

Hello, We’re Resonite Technologies, a software engineering team with experience in sourcing, auditing, and delivering high-quality GitHub repositories for training corpora and research projects. We can help you compile a list of private repositories meeting your strict criteria. Our Approach: • Identify candidate repos with: – ≥10,000 lines of code – ≥50 commits on the main branch – ≥5 merged pull requests • Validate each repo using cloc, git log, and GitHub insights • Provide a short list with: – GitHub URL – Primary language – Snapshot of line count, commits, and merged PRs • Upon approval, deliver zipped clones including full commit history Deliverables: • Verified repo list matching your thresholds • Validation snapshots for each repo • Complete, uncorrupted archives or Git clones ready for LLM ingestion Why Resonite Technologies: • Familiar with private/public repo management • Strong Git expertise and history analysis • Experience preparing curated datasets for AI/ML pipelines Timeline: 3–5 days for shortlist and validation, delivery ready immediately after approval. Best regards, Resonite Technologies
₹86.250 INR em 7 dias
0,0
0,0

Pune, India
Membro desde out. 8, 2023
₹75000-150000 INR
₹100-400 INR / hora
₹600-1500 INR
$750-1500 USD
$1500-3000 USD
₹12500-37500 INR
$750-1500 USD
₹600-1500 INR
$30-250 USD
₹1500-12500 INR
₹12500-37500 INR
$250-750 USD
$250-750 USD
₹12500-37500 INR
€8-500 EUR
$1500-3000 USD
₹12500-37500 INR
$30-250 USD
₹12500-37500 INR
$30-250 USD
₹1500-12500 INR
₹12500-37500 INR
$250-750 USD