- DagsterRunnow has a- job_nameproperty, which should be used instead of- pipeline_name.
- TimeWindowPartitionsDefinitionnow has a- get_partition_keys_in_rangemethod which returns a sequence of all the partition keys between two partition keys.
- OpExecutionContextnow has- asset_partitions_def_for_outputand- asset_partitions_def_for_inputmethods.
- Dagster now errors immediately with an informative message when two AssetsDefinitionobjects with the same key are provided to the same repository.
- build_output_contextnow accepts a- partition_keyargument that can be used when testing the- handle_outputmethod of an IO manager.
- Fixed a bug that made it impossible to load inputs using a DagsterTypeLoader if the InputDefinition had an asset_keyset.
- Ops created with the @assetand@multi_assetdecorators no longer have a top-level “assets” entry in their config schema. This entry was unused.
- In 0.15.6, a bug was introduced that made it impossible to load repositories if assets that had non-standard metadata attached to them were present. This has been fixed.
- [dagster-dbt] In some cases, using load_assets_from_dbt_manifestwith aselectparameter that included sources would result in an error. This has been fixed.
- [dagit] Fixed an error where a race condition of a sensor/schedule page load and the sensor/schedule removal caused a GraphQL exception to be raised.
- [dagit] The “Materialize” button no longer changes to “Rematerialize” in some scenarios
- [dagit] The live overlays on asset views, showing latest materialization and run info, now load faster
- [dagit] Typing whitespace into the launchpad Yaml editor no longer causes execution to fail to start
- [dagit] The explorer sidebar no longer displays “mode” label and description for jobs, since modes are deprecated.
- An error will now be raised if a @repositorydecorated function expects parameters. Thanks @roeij!
- The non-asset version of the Hacker News example, which lived inside examples/hacker_news/, has been removed, because it hadn’t received updates in a long time and had drifted from best practices. The asset version is still there and has an updated README. Check it out here
- When an exception is wrapped by another exception and raised within an op, Dagit will now display the full chain of exceptions, instead of stopping after a single exception level.
- A default_logger_defsargument has been added to the@repositorydecorator. Check out the docs on specifying default loggers to learn more.
- AssetsDefinition.from_graphand- AssetsDefinition.from_opnow both accept a- partition_mappingsargument.
- AssetsDefinition.from_graphand- AssetsDefinition.from_opnow both accept a- metadata_by_output_nameargument.
- define_asset_jobnow accepts an- executor_defargument.
- Removed package pin for gqlindagster-graphql.
- You can now apply a group name to assets produced with the @multi_assetdecorator, either by supplying agroup_nameargument (which will apply to all of the output assets), or by setting thegroup_nameargument on individualAssetOuts.
- InputContextand- OutputContextnow each have an- asset_partitions_defproperty, which returns the- PartitionsDefinitionof the asset that’s being loaded or stored.
- build_schedule_from_partitioned_jobnow raises a more informative error when provided a non-partitioned asset job
- PartitionMapping,- IdentityPartitionMapping,- AllPartitionMapping, and- LastPartitionMappingare exposed at the top-level- dagsterpackage. They're currently marked experimental.
- When a non-partitioned asset depends on a partitioned asset, you can now control which partitions of the upstream asset are used by the downstream asset, by supplying a PartitionMapping.
- You can now set PartitionMappingsonAssetIn.
- [dagit] Made performance improvements to the loading of the partitions and backfill pages.
- [dagit] The Global Asset Graph is back by popular demand, and can be reached via a new “View global asset lineage ”link on asset group and asset catalog pages! The global graph keeps asset in the same group visually clustered together and the query bar allows you to visualize a custom slice of your asset graph.
- [dagit] Simplified the Content Security Policy and removed frame-ancestorsrestriction.
- [dagster-dbt] load_assets_from_dbt_projectandload_assets_from_dbt_manifestnow support anode_info_to_group_name_fnparameter, allowing you to customize which group Dagster will assign each dbt asset to.
- [dagster-dbt] When you supply a runtime_metadata_fnwhen loading dbt assets, this metadata is added to the default metadata that dagster-dbt generates, rather than replacing it entirely.
- [dagster-dbt] When you load dbt assets with use_build_command=True, seeds and snapshots will now be represented as Dagster assets. Previously, only models would be loaded as assets.
- Fixed an issue where runs that were launched using the DockerRunLauncherwould sometimes use Dagit’s Python environment as the entrypoint to launch the run, even if that environment did not exist in the container.
- Dagster no longer raises a “Duplicate definition found” error when a schedule definition targets a partitioned asset job.
- Silenced some erroneous warnings that arose when using software-defined assets.
- When returning multiple outputs as a tuple, empty list values no longer cause unexpected exceptions.
- [dagit] Fixed an issue with graph-backed assets causing a GraphQL error when graph inputs were type-annotated.
- [dagit] Fixed an issue where attempting to materialize graph-backed assets caused a graphql error.
- [dagit] Fixed an issue where partitions could not be selected when materializing partitioned assets with associated resources.
- [dagit] Attempting to materialize assets with required resources now only presents the launchpad modal if at least one resource defines a config schema.
- An op with a non-optional DynamicOutput will now error if no outputs are returned or yielded for that dynamic output.
- If an Outputobject is used to type annotate the return of an op, an Output object must be returned or an error will result.
- Dagit now displays the path of the output handled by PickledObjectS3IOManagerin run logs and Asset view. Thanks @danielgafni
- The Hacker News example now uses stable 0.15+ asset APIs, instead of the deprecated 0.14.x asset APIs.
- Fixed the build command in the instructions for contributing docs changes.
- [dagster-dbt] The dagster-dbt integration guide now contains information on using dbt with Software-Defined Assets.
- Added documentation and helm chart configuration for threaded sensor evaluations.
- Added documentation and helm chart configuration for tick retention policies.
- Added descriptions for default config schema. Fields like execution, loggers, ops, and resources are now documented.
- UnresolvedAssetJob objects can now be passed to run status sensors.
- [dagit] A new global asset lineage view, linked from the Asset Catalog and Asset Group pages, allows you to view a graph of assets in all loaded asset groups and filter by query selector and repo.
- [dagit] A new option on Asset Lineage pages allows you to choose how many layers of the upstream / downstream graph to display.
- [dagit] Dagit's DAG view now collapses large sets of edges between the same ops for improved readability and rendering performance.
- Fixed a bug with materializethat would cause required resources to not be applied correctly.
- Fixed issue that caused repositories to fail to load when build_schedule_from_partitioned_jobanddefine_asset_jobwere used together.
- Fixed a bug that caused auto run retries to always use the FROM_FAILUREstrategy
- Previously, it was possible to construct Software-Defined Assets from graphs whose leaf ops were not mapped to assets. This is invalid, as these ops are not required for the production of any assets, and would cause confusing behavior or errors on execution. This will now result in an error at definition time, as intended.
- Fixed issue where the run monitoring daemon could mark completed runs as failed if they transitioned quickly between STARTING and SUCCESS status.
- Fixed stability issues with the sensor daemon introduced in 0.15.3 that caused the daemon to fail heartbeat checks if the sensor evaluation took too long.
- Fixed issues with the thread pool implementation of the sensor daemon where race conditions caused the sensor to fire more frequently than the minimum interval.
- Fixed an issue with storage implementations using MySQL server version 5.6 which caused SQL syntax exceptions to surface when rendering the Instance overview pages in Dagit.
- Fixed a bug with the default_executor_defargument on repository where asset jobs that defined executor config would result in errors.
- Fixed a bug where an erroneous exception would be raised if an empty list was returned for a list output of an op.
- [dagit] Clicking the "Materialize" button for assets with configurable resources will now present the asset launchpad.
- [dagit] If you have an asset group and no jobs, Dagit will display it by default rather than directing you to the asset catalog.
- [dagit] DAG renderings of software-defined assets now display only the last component of the asset's key for improved readability.
- [dagit] Fixes a regression where clicking on a source asset would trigger a GraphQL error.
- [dagit] Fixed issue where the “Unloadable” section on the sensors / schedules pages in Dagit were populated erroneously with loadable sensors and schedules
- [dagster-dbt] Fixed an issue where an exception would be raised when using the dbt build command with Software-Defined Assets if a test was defined on a source.
- Removed the deprecated dagster-daemon health-check CLI command
- TimeWindow is now exported from the dagster package (Thanks @nvinhphuc!)
- Added a fix to allow customization of slack messages (Thanks @solarisa21!)
- [dagster-databricks] The databricks_pyspark_step_launchernow allows you to configure the following (Thanks @Phazure!):- the aws_attributesof the cluster that will be spun up for the step.
- arbitrary environment variables to be copied over to databricks from the host machine, rather than requiring these variables to be stored as secrets.
- job and cluster permissions, allowing users to view the completed runs through the databricks console, even if they’re kicked off by a service account.
 
- [dagster-k8s] Added - k8s_job_opto launch a Kubernetes Job with an arbitrary image and CLI command. This is in contrast with the- k8s_job_executor, which runs each Dagster op in a Dagster job in its own k8s job. This op may be useful when you need to orchestrate a command that isn't a Dagster op (or isn't written in Python). Usage:
 - from dagster_k8s import k8s_job_op
my_k8s_op = k8s_job_op.configured({
 "image": "busybox",
 "command": ["/bin/sh", "-c"],
 "args": ["echo HELLO"],
 },
 name="my_k8s_op",
)
 
 
- [dagster-dbt] The dbt asset-loading functions now support - partitions_defand- partition_key_to_vars_fnparameters, adding preliminary support for partitioned dbt assets. To learn more, check out the Github issue!
 
- Reverted sensor threadpool changes from 0.15.3 to address daemon stability issues.
- When loading an upstream asset or op output as an input, you can now set custom loading behavior using the input_manager_key argument to AssetIn and In
- The list of objects returned by a repository can now contain nested lists.
- Added a data retention instance setting in dagster.yaml that enables the automatic removal of sensor/schedule ticks after a certain number of days.
- Added a sensor daemon setting in dagster.yaml that enables sensor evaluations to happen in a thread pool to increase throughput.
- materialize_to_memoryand materialize now both have the partition_key argument.
- Outputand- DynamicOutputobjects now work with deep equality checks:
Output(value=5, name="foo") == Output(value=5, name="foo") 
- RunRequests can now be returned from run status sensors
- Added resource_defsargument toAssetsDefinition.from_graph. Allows for specifying resources required by constituent ops directly on the asset.
- When adding a tag to the Run search filter in Dagit by clicking the hover menu on the tag, the tag will now be appended to the filter instead of replacing the entire filter state.
- [dagster-dbt] An exception is now emitted if you attempt to invoke the library without having dbt-core installed. dbt-core is now also added as a dependency to the library.
- Asset group names can now contain reserved python keywords
- Fixed a run config parsing bug that was introduced in 0.15.1that caused Dagit to interpret datetime strings as datetime objects and octal strings as integers.
- Runs that have failed to start are now represented in the Instance Timeline view on Dagit.
- Fixed an issue where the partition status was missing for partitioned jobs that had no runs.
- Fixed a bug where op/resource invocation would error when resources were required, no context was used in the body of the function, and no context was provided when invoking.
- [dagster-databricks] Fixed an issue where an exception related to the deprecated prior_attempts_count field when using the databricks_pyspark_step_launcher.
- [dagster-databricks] Polling information logged from the databricks_pyspark_step_launcher is now emitted at the DEBUG level instead of INFO.
- In the yaml editor in Dagit, the typeahead feature now correctly shows suggestions for nullable schema types.
- When editing asset configuration in Dagit, the “Scaffold config” button in the Dagit launchpad sometimes showed the scaffold dialog beneath the launchpad. This has been fixed.
- A recent change added execution timezones to some human-readable cron strings on schedules in Dagit. This was added incorrectly in some cases, and has now been fixed.
- In the Dagit launchpad, a config state containing only empty newlines could lead to an error that could break the editor. This has been fixed.
- Fixed issue that could cause partitioned graph-backed assets to attempt to load upstream inputs from the incorrect path when using the fs_io_manager (or other similar io managers).
- [dagster-dbt] Fixed issue where errors generated from issuing dbt cli commands would only show json-formatted output, rather than a parsed, human-readable output.
- [dagster-dbt] By default, dagster will invoke the dbt cli with a --log-format json flag. In some cases, this may cause dbt to report incorrect or misleading error messages. As a workaround, it is now possible to disable this behavior by setting the json_log_format configuration option on the dbt_cli_resource to False.
- materialize_to_memory erroneously allowed non-in-memory io managers to be used. Now, providing io managers to materialize_to_memory will result in an error, and mem_io_manager will be provided to all io manager keys.