Data lakes have emerged as a special storage and repository mechanism of the raw unsoiled data in its native form. Organizations today generate enormous data related to various aspects of activity. In order to resolve the challenges of data storage, integration and accessibility, data lake is created. It allows refining, exploring and enriching the data as per the organizational requirements.