Skip to content

Massively scalable, secure data lake functionality built on Azure Blob Storage

  • Pros

  • Built on Azure Storage
  • Good integration with Azure data services
  • Integration with Open Source platforms via HDFS
  • Hierarchical namespace
  • Optimized for big data & big compute
  • POSIX permissions
  • ABFS optimized driver
  • Multi-protocol SDK
  • Cons

  • Throughput limits apply
  • Not all blob storage features yet supported

Read our blog posts about Azure Data Lake Storage

Implementing the OpenChain Specification

Implementing the OpenChain Specification

Charlotte Gayton

After a year of working on implementing the OpenChain specification, this blog takes you through the processes we created to track and manage our open-source licenses