Home:ALL Converter>Hadoop distcp to S3

Hadoop distcp to S3

Ask Time:2023-01-05T07:31:12         Author:Rajkumar

Json Formatter

I am using Hadoop distcp command to move data from hdfs to s3. Recently after hadoop cdh to cdp upgrade I am facing a difference in -update option. Previously-update will move files with same file name, same size but with different content. now it ignores the file if name and size are same. Is there anyways to move achieve this kind of update in cloudera cdp version?

hadoop distcp -pu -update -delete hdfspath s3bucket

Author:Rajkumar,eproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/75012441/hadoop-distcp-to-s3
yy