Wednesday, May 16, 2018

Enable HDFS Short Circuit Reads

short-circuit reads bypass the DataNode, allowing a client to read the file directly, as long as the client is co-located with the data.

Add Below properties in hdfs-site.xml.


    dfs.client.read.shortcircuit  - true

    dfs.client.read.shortcircuit.streams.cache.size -  1000

    dfs.client.read.shortcircuit.streams.cache.expiry.ms -  10000

    dfs.domain.socket.path -   /var/run/hadoop-hdfs/dn._PORT

No comments: