Manage Azure Data Lake Storage assets¶
Operations on ADLS assets (connections, accounts, containers, objects).
In general, these should be:
- Created in top-down order (connection, then account, then container, then object)
- Deleted in bottom-up order (objects, then containers, then accounts, then connections)1
erDiagram
Connection ||--o{ ADLSAccount : contains
ADLSAccount ||--o{ ADLSContainer : contains
ADLSContainer ||--o{ ADLSObject : contains
Asset structure¶
Connection¶
An Azure Data Lake Storage connection requires a name
and qualifiedName
. For creation, specific settings are also required to distinguish it as an Azure Data Lake Storage connection rather than another type of connection. In addition, at least one of adminRoles
, adminGroups
, or adminUsers
must be provided.
Create an ADLS connection | |
---|---|
1 2 3 4 5 6 7 8 9 10 |
|
- Retrieve the GUID for the admin role, to use later for defining the roles that can administer the connection.
- Build up the minimum request to create a connection.
- Provide a human-readable name for your connection, such as
production
ordevelopment
. - Set the type of connection to ADLS.
- List the workspace roles that should be able to administer the connection (or null if none). All users with that workspace role (current and future) will be administrators of the connection. Note that the values here need to be the GUID(s) of the workspace role(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided. - List the group names that can administer this connection (or null if none). All users within that group (current and future) will be administrators of the connection. Note that the values here are the name(s) of the group(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided. - List the user names that can administer this connection (or null if none). Note that the values here are the username(s) of the user(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided. - Actually call Atlan to create the connection.
- Retrieve the qualifiedName for use in subsequent creation calls. (You'd probably want to do some null checking first.)
Create an ADLS connection | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
|
- Retrieve the GUID for the admin role, to use later for defining the roles that can administer the connection.
- Build up the minimum request to create a connection.
- Provide a human-readable name for your connection, such as
production
ordevelopment
. - Set the type of connection to ADLS.
- List the workspace roles that should be able to administer the connection (or
None
if none). All users with that workspace role (current and future) will be administrators of the connection. Note that the values here need to be the GUID(s) of the workspace role(s). At least one ofadmin_roles
,admin_groups
, oradmin_users
must be provided. - List the group names that can administer this connection (or
None
if none). All users within that group (current and future) will be administrators of the connection. Note that the values here are the name(s) of the group(s). At least one ofadmin_roles
,admin_groups
, oradmin_users
must be provided. - List the user names that can administer this connection (or
None
if none). Note that the values here are the username(s) of the user(s). At least one ofadmin_roles
,admin_groups
, oradmin_users
must be provided. - Actually call Atlan to create the connection.
- Retrieve the
qualified_name
for use in subsequent creation calls. (You'd probably want to do some null checking first.)
POST /api/meta/entity/bulk | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
|
- The
typeName
must be exactlyConnection
. - Human-readable name for your connection, such as
production
ordevelopment
. - The
connectorName
must be exactlyadls
. - The
qualifiedName
should follow the pattern:default/adls/<epoch>
, where<epoch>
is the time in milliseconds at which the connection is being created. - The
category
must beObjectStore
. - List any workspace roles that can administer this connection. All users with that workspace role (current and future) will be administrators of the connection. Note that the values here need to be the GUID(s) of the workspace role(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided. - List any groups that can administer this connection. All users within that group (current and future) will be administrators of the connection. Note that the values here are the name(s) of the group(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided. - List any users that can administer this connection. Note that the values here are the username(s) of the user(s). At least one of
adminRoles
,adminGroups
, oradminUsers
must be provided.
Access policies
Atlan creates the policies that grant access to a connection, including the ability to retrieve the connection and to create assets within it, asynchronously. It can take several seconds (even up to approximately 30 seconds) before these are in place after creating the connection.
You may therefore need to wait before you'll be able to create the assets below within the connection.
To confirm access, retrieve the connection after it has been created. The SDKs' retry loops will automatically retry until the connection can be successfully retrieved. At that point, your API token has permission to create the other assets.
Note: if you are reusing an existing connection rather than creating one via your API token, you must give your API token a persona that has access to that connection. Otherwise all attempts to create, read, update, or delete assets within that connection will fail due to a lack of permissions.
ADLSAccount¶
An Azure Data Lake Storage account requires a name
and a qualifiedName
. For creation, you also need to specify the connectionQualifiedName
of the connection for the account.
Create an ADLS account | |
---|---|
11 12 13 14 15 16 |
|
- Build up the minimum request to create an account.
- Provide a human-readable name for your account.
- Provide the qualifiedName of the connection for this account.
- Actually call Atlan to create the account.
- Retrieve the created account for use in subsequent creation calls. (You'd probably want to do some null checking first.)
Create an ADLS account | |
---|---|
17 18 19 20 21 22 |
|
- Build up the minimum request to create an account.
- Provide a human-readable name for your account.
- Provide the
qualified_name
of the connection for this account. - Actually call Atlan to create the account.
- Retrieve the created account for use in subsequent creation calls. (You'd probably want to do some null checking first.)
POST /api/meta/entity/bulk | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 |
|
- The
typeName
must be exactlyADLSAccount
. - Human-readable name for your account.
- The
qualifiedName
should follow the pattern:default/adls/<epoch>/<name>
, wheredefault/adls/<epoch>
is the qualifiedName of the connection for this account and<name>
is the unique name for this account. - The
connectionQualifiedName
must be the exact qualifiedName of the connection for this account. - The
connectorName
must be exactlyadls
.
ADLSContainer¶
An Azure Data Lake Storage container requires a name
and a qualifiedName
. For creation, you also need to specify the account that will contain the container.
Create an ADLS container | |
---|---|
17 18 19 20 21 22 23 |
|
- Build up the minimum request to create a container.
- Provide a human-readable name for your container.
- Provide the account for this container. If you did not already have the object, you could also use
ADLSAccount.refByGuid()
with the GUID of the account, orADLSAccount.refByQualifiedName()
with thequalifiedName
of the account. - (Optional) To ensure the UI displays the correct count of
ADLSObject
's, set theadlsObjectCount
directly on theADLSContainer
instance. - Actually call Atlan to create the object.
- Retrieve the created container for use in subsequent creation calls. (You'd probably want to do some null checking first.)
Create an ADLS container | |
---|---|
23 24 25 26 27 28 29 |
|
- Build up the minimum request to create a container.
- Provide a human-readable name for your container.
- Provide the
qualified_name
of the ADLS account for this container. - (Optional) To ensure the UI displays the correct count of
ADLSObject
's, set theadls_object_count
directly on theADLSContainer
instance. - Actually call Atlan to create the object.
- Retrieve the created container for use in subsequent creation calls. (You'd probably want to do some null checking first.)
POST /api/meta/entity/bulk | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
|
- The
typeName
must be exactlyADLSContainer
. - Human-readable name for your container.
- The
qualifiedName
should follow the pattern:default/adls/<epoch>/<account>/<name>
, wheredefault/adls/<epoch>/<account>
is the qualifiedName of the account for this container and<name>
is the unique name for this container. - The
connectionQualifiedName
must be the exact qualifiedName of the connection for this container. - The
connectorName
must be exactlyadls
. - The account in which this container exists is embedded in the
adlsAccount
attribute. - The
typeName
for this embedded reference must beADLSAccount
. - To complete the reference, you must include a
uniqueAttributes
object with the qualifiedName of the account. Note: the account must already exist in Atlan before creating the container.
ADLSObject¶
An Azure Data Lake Storage object requires a name
and a qualifiedName
. For creation, you also need to specify the containerQualifiedName
of the container that will contain the object.
Create an ADLS object | |
---|---|
23 24 25 26 27 |
|
- Build up the minimum request to create an object.
- Provide a human-readable name for your object.
- Provide the container for this object. If you did not already have the container, you could also use
ADLSContainer.refByGuid()
with the GUID of the container, orADLSContainer.refByQualifiedName()
with thequalifiedName
of the container. - Actually call Atlan to create the object.
Create an ADLS object | |
---|---|
29 30 31 32 33 |
|
- Build up the minimum request to create an object.
- Provide a human-readable name for your object.
- Provide the
qualified_name
of the ADLS container. - Actually call Atlan to create the object.
POST /api/meta/entity/bulk | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
|
- The
typeName
must be exactlyADLSObject
. - Human-readable name for your object.
- The
qualifiedName
should follow the pattern:default/adls/<epoch>/<account>/<container>/<name>
, wheredefault/adls/<epoch>/<account>/<container>
is the qualifiedName of the container for this object and<name>
is the unique name for this object. - The
connectionQualifiedName
must be the exact qualifiedName of the connection for this object. - The
adlsAccountQualifiedName
must be the exact qualifiedName of the ADLS account. - The
connectorName
must be exactlyadls
. - The container in which this object exists is embedded in the
adlsContainer
attribute. - The
typeName
for this embedded reference must beADLSContainer
. - To complete the reference, you must include a
uniqueAttributes
object with the qualifiedName of the container. Note: the container must already exist in Atlan before creating the object.
Available relationships¶
Every level of the object store structure is an Asset
, and can therefore be related to the following other assets.
erDiagram
Asset }o--o{ AtlasGlossaryTerm : meanings
Asset ||--o{ Link : links
Asset ||--o| Readme : readme
Asset }o--o{ Process : inputToProcesses
Asset }o--o{ Process : outputFromProcesses
AtlasGlossaryTerm¶
A glossary term provides meaning to an asset. The link terms to assets snippet provides more detail on setting this relationship.
Link¶
A link provides additional context to an asset, by providing a URL to additional information.
Readme¶
A README provides rich documentation for an asset. The add asset READMEs snippet provides more detail on setting this relationship.
Process¶
A process provides lineage information for an asset. An asset can be both an input and an output for one or more processes. The lineage snippets provide more detail on creating and working with lineage.
-
Although if you want to delete everything in a connection, your better avenue is the packaged connection delete utility in the UI. ↩