2. Ingesting the extracted data into hadoop cluster (csv, xml, json, fixed width etc)
3. Using mapreduce, pig to transform the data and load it into mpp.
4. Stitch all above into one process using schedulers.
","employmentType":["FULL_TIME","PART_TIME","CONTRACTOR","TEMPORARY","PER_DIEM"],"jobLocationType":"TELECOMMUTE","hiringOrganization":{"@type":"Organization","name":"Toogit","sameAs":"https://www.toogit.com/","logo":"https://www.toogit.com/images/toogit_logo_initial.png"},"identifier":{"@type":"PropertyValue","name":"Toogit","value":362642},"skills":["Apache Flume","Apache Hive","Apache Spark","Hadoop","Hbase"],"applicantLocationRequirements":[{"@type":"Country","name":"IN"},{"@type":"Country","name":"Canada"},{"@type":"Country","name":"USA"},{"@type":"Country","name":"Germany"},{"@type":"Country","name":"Pakistan"},{"@type":"Country","name":"Philippines"},{"@type":"Country","name":"Indonesia"},{"@type":"Country","name":"Sri Lanka"},{"@type":"Country","name":"Nigeria"},{"@type":"Country","name":"China"},{"@type":"Country","name":"Russia"},{"@type":"Country","name":"Bangladesh"}],"validThrough":"2024-09-14T22:24:19+05:30","url":"https://www.toogit.com/freelance-jobs/MzYyNjQy"}
Remote Network And System Administration Job In IT And Networking
Find more Network And System Administration remote jobs posted recently Worldwide
Work from Anywhere
40 hrs / weekHourly Type
Remote Job$19.14
Cost Looking for help? Checkout our video tutorial
How to search and apply for jobs
How to apply? Do you have more questions about the Job?
See frequently asked questions
We are looking for a freelancer who has proven experience in Data Engineering projects.
The requirements we are looking for:
- Experience with Python
- Experience with Big Data tools (eg: Hadoop, Cassandra, Kafka)
- Experience wi...read more
HI,
I need a small less than 1 hour task to be done using Pyspark.
Add some spark code to existing python code.
I will share my use case and am looking to build out Advance analytics and BI platform thats cost-effective yet viable (capable of working successfully) and scalable with shortliste candidates.
Current Use Case is to collect data from CRM...read more
I need to convert JSON, Avro or other row-based format files in S3 into Parquet columnar store formats using an AWS service like EMR or Glue.
I already have code that converts JSON to parquet using Python but the process is very manual, acco...read more