list_data_sources
Browse and search data sources in an Amazon DataZone domain by project, type, status, or name. Use filters to find and select data sources without knowing their exact IDs.
Instructions
Retrieve a list of data sources in Datazone domain
Use this API when the user is browsing, searching, or filtering data sources — especially if they don't know the exact ID or want to find a list to choose from.
This is not the correct API if the user asks for config details of a known data source — use get_data_source in that case.
related tools: get_data_source: Retrieves detailed information about a known data source. Use get_data_source when you want to fetch info about the connection details, authentication settings, or ingestion configuration of a particular data source.
Args: domainIdentifier (str): The identifier of the Amazon DataZone domain in which to list the data sources. Pattern: ^dzd[-][a-zA-Z0-9-]{1,36}$ Required: Yes projectIdentifier (str): The identifier of the project in which to list data sources. Required: Yes connectionIdentifier (str, optional): The ID of the connection used to filter the data sources. environmentIdentifier (str, optional): The identifier of the environment in which to list the data sources. maxResults (int, optional): The maximum number of data sources to return in one response. Valid Range: 1–50 name (str, optional): Filter by name of the data source. Length Constraints: 1–256 characters nextToken (str, optional): A pagination token for fetching the next set of results. Length Constraints: 1–8192 characters status (str, optional): Filter data sources by their current status. Valid values: - CREATING - FAILED_CREATION - READY - UPDATING - FAILED_UPDATE - RUNNING - DELETING - FAILED_DELETION type (str, optional): Filter by the type of data source (e.g., GLUE, REDSHIFT). Length Constraints: 1–256 characters
Returns: dict: A dictionary with the following keys: - items (List[dict]): A list of DataSourceSummary objects containing: - connectionId (str) - createdAt (str) - dataSourceId (str) - description (str) - domainId (str) - enableSetting (str) - environmentId (str) - lastRunAssetCount (int) - lastRunAt (str) - lastRunErrorMessage (dict): Contains "errorDetail" and "errorType" - lastRunStatus (str) - name (str) - schedule (dict): Contains "schedule" and "timezone" - status (str) - type (str) - updatedAt (str)
- nextToken (str): Token to retrieve the next page of results, if any.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| name | No | ||
| status | No | ||
| next_token | No | ||
| max_results | No | ||
| data_source_type | No | ||
| domain_identifier | Yes | ||
| project_identifier | Yes | ||
| connection_identifier | No | ||
| environment_identifier | No |