
Fechado
Publicado
Pago na entrega
I am seeking high quality github codebases, I want these codebases to train LLM. The repositores should satisfy the following criteria: 1. 10k+ lines of codes 2. 50+ commits 3. 5+ PRs If you have such quality codebase then lets discuss.
ID do Projeto: 40177639
22 propostas
Projeto remoto
Ativo há 6 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
22 freelancers estão ofertando em média ₹109.960 INR for esse trabalho

With a project as specific as yours, the need for not just large codebases but quality ones cannot be emphasized enough, and that's where CnELIndia shines. We have maintained an astounding average line count of 15k+ across all our projects, translating to dedicated experience in handling high volume codebases such as what you require. But simply having voluminous code is not enough, there also needs to be methodical and flexible management, which my company has displayed with the impressive record of over 50 commits from our developers on various projects. Our experience with data scraping complements your requirement to 'train LLM' perfectly. With Python being our weapon of choice for scraping tasks, by leveraging it we can ensure we bring you the highest quality repositories that also align seamlessly with your project's objectives. As a bonus, you get to work with a highly renowned Five-Star rated team which comprises skilled eCommerce practitioners in diverse fields including Graphic Design, HTML, Software Development and more who always complete projects within timeframes and budgetary constraints without compromising on quality…but hey, don't merely take my word for it; Our clients speak for us through their _743 satisfied testimonials_. So if you're ready to make this project a resounding success, let's get talking!
₹112.500 INR em 25 dias
9,1
9,1

With my diverse skill set of being a Full-Stack Developer and Biostatistician, I can offer you an exceptionally unique proposition. Not only have I developed various high-quality codebases exceeding the given criteria of your project but also have employed these skills to conduct analytical work for research purposes. In terms of advanced software development, I meet all your requirements rigorously. I am well-versed with numerous programming languages such as JavaScript (React, Angular), Python (Django, Flask), and C#. Alongside my swift adaptation to new technologies, I am proficient in handling cloud services like AWS and have strong expertise in implementing CI/CD strategies. All this allows me to efficiently manage not just the design and development aspect but also the backend architecture and API development, essential for large-scale codebases. On the Biostatistics side, my experience in conducting thorough statistical analysis for years adds another dimension to the value I can bring to your LLM training project. My command over R, SAS, SPSS, and Python makes me an asset for extracting crucial researcher-friendly information from massive datasets; not just for research but also for strategic decision-making for your dataset.
₹112.500 INR em 7 dias
4,0
4,0

Hi, this is Jagrati. I’ve reviewed your requirements and understand you’re looking for high-quality, production-grade GitHub repositories suitable for LLM training, with clear signals of real-world development maturity. I can provide and discuss codebases that meet your criteria: • 10k+ lines of well-structured code • 50+ meaningful commits with clean history • 5+ pull requests reflecting collaborative development • Production-ready patterns, modular architecture, and consistent coding standards The repositories are built using modern stacks (primarily JavaScript/TypeScript, Node.js, and frontend frameworks), include real-world business logic, and demonstrate maintainable design rather than toy examples. Commit history reflects iterative development, refactors, bug fixes, and feature evolution—useful for training and evaluation purposes. I’m happy to: • Share repository overviews and metrics • Walk through architecture and code quality • Clarify licensing and usage constraints • Discuss how the codebases align with your LLM training goals If you’re open to a quick discussion, I can share examples and details right away. Looking forward to connecting. Regards, Jagrati Full-Stack Developer | Production Codebases • Clean Architecture
₹112.500 INR em 7 dias
1,0
1,0

THIS IS NOT THE AUTO BID, PLEASE REVIEW IN DETAIL Hi, I understand you’re looking for high-quality GitHub codebases suitable for LLM training, with clear criteria around code volume, commit history, and PR activity. I have access to and experience working with large, production-grade repositories that meet your requirements (10k+ LOC, 50+ commits, 5+ PRs), including well-structured code, meaningful commit histories, and real-world collaboration patterns. These are suitable for model training and analysis rather than toy or auto-generated projects. Happy to discuss available options, walk you through repository details, and ensure they align with your training goals before proceeding. Looking forward to connecting.
₹130.000 INR em 6 dias
0,0
0,0

As a seasoned Full-Stack Developer and Product Management Professional, I am highly proficient in meeting the specific requirements you've listed for high-quality codebases. With my expertise in MERN stack, Python and SQL, among others, I have led and contributed to scalable software solutions and Al-powered applications- all of which require robust, well-documented codebases. In addition to meeting your technical needs, I also bring strategic leadership, agile delivery, and deep understanding of product architecture into the mix. This blend enables me to design not just functioning product codebases but ones that are future-ready and user-centric too. Data management is a fundamental aspect of good coding practice therefore my SQL and database management skills come in handy to ensure optimized queries and resilient data models are used. Through automating deployments with CI/CD & DevOps, I can also ensure that the codebase is future-proof and streamlined for potential updates. In summary, choosing me for this project means not only getting the high-quality codebases you need but also gaining a strategic partner who understands product success and end-to-end value delivery.
₹120.000 INR em 7 dias
0,0
0,0

I can provide high-signal GitHub repositories tailored for your LLM training. I will strictly filter for 10k+ LOC, 50+ commits, and 5+ PRs, focusing on clean PHP, eCommerce, and UI/UX structures. As a developer, I don't just scrape data; I verify code quality to ensure your model learns robust logic, not noise. Timeline: I’ll complete this in 7 days , including manual verification and data cleaning. Ready to share a sample list immediately. Let’s discuss!
₹112.500 INR em 8 dias
0,0
0,0

I've got a few repos that might fit the bill: - TensorFlow (obviously huge, lots of commits/PRs) - Django - Kubernetes - React What makes these codebases stand out: - Large codebase with diverse functionality - Active communities driving development - Well-maintained with thorough PR reviews
₹112.500 INR em 7 dias
0,0
0,0

If you're looking for someone who can actually deliver results - not just promises - this proposal will be worth 60 seconds of your time. I can solve the exact problem you described—identifying high-quality GitHub codebases that meet your criteria of 10k+ lines of code, 50+ commits, and 5+ PRs—and I've done it before. Most freelancers will tell you they can do this. Very few will tell you how they'll make it work for your business. You don't need another freelancer - you need someone who understands your end goal. I specialize in researching and curating clean, professional, and user-friendly code repositories, ensuring a seamless and integrated approach to sourcing automated data that fits your training needs. While I am new to freelancer, I have tons of experience and have done other projects off site. Let's move forward and turn this into a successful project. Regards, Nadia Du Preez
₹112.500 INR em 30 dias
0,0
0,0

Hi, I’m interested in your project and confident I can deliver quality results quickly. Looking forward to working with you.
₹100.000 INR em 14 dias
0,0
0,0

Pune, India
Membro desde out. 8, 2023
₹600-1500 INR
₹37500-75000 INR
₹100-400 INR / hora
₹750-1250 INR / hora
$15-25 USD / hora
$8-15 AUD / hora
$30-250 USD
₹12500-37500 INR
$30-250 USD
₹1500-12500 INR
£18-36 GBP / hora
$30-250 USD
$2-8 USD / hora
€750-1500 EUR
$30-250 USD
$30-250 AUD
$10-30 USD
$30-250 USD
$30-250 USD
$250-750 CAD
$10-30 USD
$750-1500 AUD
$30-250 USD