Find Jobs
Hire Freelancers

Big Data project

€250-750 EUR

Fechado
Publicado há quase 3 anos

€250-750 EUR

Pago na entrega
Need: We are looking to form a team of two skilled developers that are willing to work on a big data project for the three different tasks that we will explain in a bit. Objective: The objective is to scale the platform for future needs as well as address limitations of retrieving data by building a mini search engine with Elasticsearch. Minimizing the cost is important factor which motivated the use of open-source tools such as: Apache NiFi Apache Spark Ellasandra (Cassandra + Elastic search) User stories: User story 1: As a user, I want to be able to import a heavy csv file to Cassandra. User story 2: As a user, I want to be able to search for a specific field (mail, id ...) and get the ten first occurrences displayed. User story 3: As a user I want to be able to export a csv file that has the result of the query I wrote in the search engine. Explanation of the role of the three project main functionalities: The user will have access to a simple graphical user interface that will let him choose between: - Import: has as objective to allow importing heavy datafiles - Export: has as objective to generate a text file with respect to some filter (could be a table name, or a property) specified and fetched from the search input field - Search: has as objective to filter the data and render at most 10 data rows that matches the search query and render it to the user. Note: This project generates only one view for the user based on the input in the search field. Expected features: 1) Simple graphic interface that contains the 3 main Sub functionalities of this interface: button import, button export (export can be a collection, depends on the filter and the query), Search field (to filter). • Level of priority: Medium • Expected programming skills: HTML / CSS / Flask(preferably but not a must) 2) Settle Elassandra cluster (2 databases). Load huge data file into the cluster in order to achieve distributed storage. And finally export the result of a specific query to a csv file. • Level of priority: High • Expected programming skills: python, Elasandra NoSQL (Cassandra + Elasticsearch), Apache NiFi 3)Server side for data processing: This will have two main and separate goals: First: If one of the special fields in already existent, upsert into already existing record the missing fields from the new record and vice-versa. Second: Implement The search algorithm that will get the expected row, table or even field • Level of priority: high • Expected programming skills: python, Apache Spark, Elasandra (Cassandra+Elasticsearch) Expected result: The project will be deemed successful if we see that the user stories are met, and the database fields are being updated as explained in the section before.
ID do Projeto: 30644748

Sobre o projeto

2 propostas
Projeto remoto
Ativo há 3 anos

Quer ganhar algum dinheiro?

Benefícios de ofertar no Freelancer

Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
2 freelancers estão ofertando em média €625 EUR for esse trabalho
Avatar do Usuário
Hi, I am an experienced Data Engineer with a solid background in Spark. I have worked on many Big Data projects with Spark, Scala, Python, Cassandra, Snowflake, AWS,... Let's have a call for more details about the project. Regards
€500 EUR em 7 dias
4,8 (2 avaliações)
4,4
4,4

Sobre o cliente

Bandeira do(a) FRANCE
SAINT DENIS, France
4,9
5
Método de pagamento verificado
Membro desde set. 18, 2016

Verificação do Cliente

Obrigado! Te enviamos um link por e-mail para que você possa reivindicar seu crédito gratuito.
Algo deu errado ao enviar seu e-mail. Por favor, tente novamente.
Usuários Registrados Total de Trabalhos Publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Carregando pré-visualização
Permissão concedida para Geolocalização.
Sua sessão expirou e você foi desconectado. Por favor, faça login novamente.