Web Analytics

Hadoop Admin (Remote)

  • Yes-M Systems
  • Georgia, USA
  • Sep 10, 2019

Job Description

Deploying a hadoop cluster, maintaining a hadoop cluster, adding and removing nodes using cluster monitoring tool Cloudera Manager, configuring and upgrading the cloudera manager,cdh,cdsw and kafka etc

  1. Implementing, managing and administering the overall hadoop infrastructure.
  2. Takes care of the day-to-day running of Hadoop clusters
  3. A hadoop administrator will have to work closely with the database team, network team, BI team and application teams to make sure that all the big data applications are highly available and performing as expected.
  4. If working with open source Apache Distribution then hadoop admins have to manually setup all the configurations- Core-Site, HDFS-Site, YARN-Site and Map Red-Site.
  5. However, when working with hadoop distribution like Cloudera the configuration files are setup on startup and the hadoop admin need not configure them manually.
  6. Hadoop admin is responsible for capacity planning and estimating the requirements for lowering or increasing the capacity of the hadoop cluster.
  7. Hadoop admin is also responsible for deciding the size of the hadoop cluster based on the data to be stored in HDFS.
  8. Ensure that the hadoop cluster is up and running all the time.
  9. Monitoring the cluster connectivity and performance.
  10. Manage and review Hadoop log files.
  11. Backup and recovery tasks
  12. Resource and security management
  13. Troubleshooting application errors and ensuring that they do not occur again.
  14. Assisting users on connectivity for various tools like tableau,alteryx,talend and powerbi.
  15. CDSW upgrade,administration and monitoring
  16. KAFKA upgrade,administration and monitoring - is managed by datascientists as of now
  17. Shell scripting and automation.
  18. Knowledge of python"

Interested candidates can share your resume on

Regards

Shabana

- provided by DiceTracking.aspx?y55d3lUDobQJNIQBgns7PAa