Thursday, January 28, 2016

spark solution please

Hi,
Please find this info.
Actual data file size: It's 6.4KB only


run this command

 

But getting 40409400 bytes it means 40.4094 MB
But actual size  Why ? can you help me on it please?

2 comments:

  1. Did u get a solution for this? I am also running into the same problem. Any inputs are highly appreciated.

    ReplyDelete
  2. sc.textFile(filepath) needs filepath as argument you are again providing a serialized dataframe as file path but not the correct original file path

    ReplyDelete