Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding S3 Support for Artifacts #34

Open
wants to merge 11 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions Project.toml
Original file line number Diff line number Diff line change
Expand Up @@ -4,10 +4,13 @@ authors = ["@deyandyankov and contributors"]
version = "0.4.4"

[deps]
AWS = "fbe9abb3-538b-5e4e-ba9e-bc94f4f92ebc"
AWSS3 = "1c724243-ef5b-51ab-93f4-b0a88ac62a95"
Dates = "ade2ca70-3891-5945-98fb-dc099432e06a"
FilePathsBase = "48062228-2e41-5def-b9a4-89aafe57970f"
HTTP = "cd3eb016-35fb-5094-929b-558a96fad6f3"
JSON = "682c06a0-de6a-54ab-a142-c8b1cf79cde6"
Minio = "4281f0d9-7ae0-406e-9172-b7277c1efa20"
ShowCases = "605ecd9f-84a6-4c9e-81e2-4798472b76a3"
URIs = "5c2747f8-b7ea-4ff2-ba2e-563bfd36b1d4"
UUIDs = "cf7118a7-6976-5b1a-9a39-7adc72f591a4"
Expand Down
4 changes: 4 additions & 0 deletions src/MLFlowClient.jl
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,10 @@ using URIs
using JSON
using ShowCases
using FilePathsBase: AbstractPath
using AWSS3
using Minio
using AWS: AbstractAWSConfig, AWSConfig, AWSCredentials, global_aws_config


include("types/core.jl")
export
Expand Down
59 changes: 47 additions & 12 deletions src/loggers.jl
Original file line number Diff line number Diff line change
Expand Up @@ -74,7 +74,7 @@
# Arguments
- `mlf::MLFlow`: [`MLFlow`](@ref) onfiguration. Currently not used, but when this method is extended to support `S3`, information from `mlf` will be needed.
- `run`: one of [`MLFlowRun`](@ref), [`MLFlowRunInfo`](@ref) or `String`.
- `basefilename`: name of the file to be written.
- `basefilename`: name of the file to be written; can contain a folder such as `model/mycode.jl` which the folder will be created in the artifact directory.
- `data`: artifact content, an object that can be written directly to a file handle.

# Throws
Expand All @@ -83,19 +83,54 @@
# Returns
path of the artifact that was created.
"""
function logartifact(mlf::MLFlow, run_id::AbstractString, basefilename::AbstractString, data)
function logartifact(mlf::MLFlow, run_id::AbstractString, basefilename::AbstractString, data; artifact_path="")
mlflowrun = getrun(mlf, run_id)
artifact_uri = mlflowrun.info.artifact_uri
mkpath(artifact_uri)
filepath = joinpath(artifact_uri, basefilename)
try
f = open(filepath, "w")
write(f, data)
close(f)
catch e
error("Unable to create artifact $(filepath): $e")
artifact_uri = joinpath(mlflowrun.info.artifact_uri,artifact_path,dirname(basefilename))
basefilename = basename(basefilename)

if !startswith(artifact_uri, "s3://")
mkpath(artifact_uri)
filepath = joinpath(artifact_uri, basefilename)
try
open(filepath, "w") do f
write(f, data)
end
catch e
error("Unable to create artifact $(filepath): $e")
end
else
region = get(ENV, "AWS_REGION", "") # Optional, defaults to empty if not set

Check warning on line 102 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L102

Added line #L102 was not covered by tests

if region == ""
region = get(ENV, "AWS_DEFAULT_REGION", "")

Check warning on line 105 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L104-L105

Added lines #L104 - L105 were not covered by tests
end

if haskey(ENV, "MLFLOW_S3_ENDPOINT_URL")
s3creds = AWSCredentials()
s3config = MinioConfig(ENV["MLFLOW_S3_ENDPOINT_URL"], s3creds; region=region)

Check warning on line 110 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L108-L110

Added lines #L108 - L110 were not covered by tests
else
s3config = global_aws_config() # default AWS configuration

Check warning on line 112 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L112

Added line #L112 was not covered by tests
end

filepath = joinpath(artifact_uri, basefilename)
artifact_uri = rstrip(artifact_uri[6:end], '/')# get rid of s3:// so s3_put doesnt' complain

Check warning on line 116 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L115-L116

Added lines #L115 - L116 were not covered by tests

try

Check warning on line 118 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L118

Added line #L118 was not covered by tests
#TODO: Figure out the correct IO stream way of doing this
open(joinpath("/tmp/",basefilename), "w") do f
write(f, data)

Check warning on line 121 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L120-L121

Added lines #L120 - L121 were not covered by tests
end
open(joinpath("/tmp/",basefilename), "r") do f

Check warning on line 123 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L123

Added line #L123 was not covered by tests
file_data = read(f)
s3_put(s3config, artifact_uri, basefilename, file_data)
end
rm(joinpath("/tmp",basefilename))

Check warning on line 127 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L127

Added line #L127 was not covered by tests
catch e
error("Unable to upload artifact to S3 $(filepath): $e")

Check warning on line 129 in src/loggers.jl

View check run for this annotation

Codecov / codecov/patch

src/loggers.jl#L129

Added line #L129 was not covered by tests
end
end
filepath

return filepath
end
logartifact(mlf::MLFlow, run::MLFlowRun, basefilename::AbstractString, data) =
logartifact(mlf, run.info, basefilename, data)
Expand Down
Loading