Getting a handle on the blobs behind Unity Catalog
A Python and PySpark approach to analyzing managed table storage in Azure Blob
By Matt Adams
Often when working with Unity Catalog, especially with managed tables, the underlying blob storage structure can feel a bit opaque. Understanding storage consumption at the blob level is crucial for cost analysis, identifying unexpectedly large tables, tracking storage growth patterns, and debugging storage issues. This guide provides a practical approach...
[Read More]