Skip to content

Commit

Permalink
Backport multiple PRs to 2.x from main (#1662)
Browse files Browse the repository at this point in the history
* add triaging doc (#1250)

Signed-off-by: Yaliang Wu <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>

* Fix missing lombok version compilation failure issue (#1278)

* Fix missing lombok version compilation failure issue

Signed-off-by: zane-neo <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>

* Updated developer guide to include links for getting started with git (#1409)

Signed-off-by: Divit Rawal <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>

* add Austin and Henry as maintainer (#1579)

Signed-off-by: Yaliang Wu <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>

* Apply spotless to ml-algorithms package (#1610)

Signed-off-by: Tyler Ohlsen <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>

* fix format violations

Signed-off-by: Bhavana Ramaram <[email protected]>

---------

Signed-off-by: Yaliang Wu <[email protected]>
Signed-off-by: Bhavana Ramaram <[email protected]>
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: Divit Rawal <[email protected]>
Signed-off-by: Tyler Ohlsen <[email protected]>
Co-authored-by: Yaliang Wu <[email protected]>
Co-authored-by: zane-neo <[email protected]>
Co-authored-by: Divit Rawal <[email protected]>
Co-authored-by: Tyler Ohlsen <[email protected]>
  • Loading branch information
5 people authored Nov 27, 2023
1 parent df5e065 commit 5be107b
Show file tree
Hide file tree
Showing 95 changed files with 3,041 additions and 2,073 deletions.
3 changes: 1 addition & 2 deletions .github/CODEOWNERS
Validating CODEOWNERS rules …
Original file line number Diff line number Diff line change
@@ -1,2 +1 @@
# This should match the owning team set up in https://github.com/orgs/opensearch-project/teams
* @opensearch-project/machine-learning
* @b4sjoo @dhrubo-os @jngz-es @model-collapse @rbhavna @wujunshen @ylwu-amzn @zane-neo @Zhangxunmt @austintlee @HenryL27
4 changes: 4 additions & 0 deletions DEVELOPER_GUIDE.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,6 +46,10 @@ If you already have these installed or have your own preferences for installing

If you don't already have it installed (check with `git --version`) we recommend following [the `git` installation guide for your OS](https://git-scm.com/downloads).

Resources to get started with git:
- [Learn git branching](https://learngitbranching.js.org/)
- [Contributing your first PR](https://gist.github.com/ashwin-pc/84ee8bd937caecd6a193e2a6a4c4fe34)

#### Install `Java`

You can install any version of Java starting from 17. [`Jenv`](https://www.jenv.be/) is a good option to use so that you can have multiple versions of Java.
Expand Down
27 changes: 20 additions & 7 deletions MAINTAINERS.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,10 +4,23 @@ This document contains a list of maintainers in this repo. See [opensearch-proje

## Current Maintainers

| Maintainer | GitHub ID | Affiliation |
|----------------------|---------------------------------------------|-------------|
| Bhavana Goud Ramaram | [rbhavna](https://github.com/rbhavna) | Amazon |
| Jing Zhang | [jngz-es](https://github.com/jngz-es) | Amazon |
| Sicheng Song | [b4sjoo](https://github.com/b4sjoo) | Amazon |
| Xun Zhang | [Zhangxunmt](https://github.com/Zhangxunmt) | Amazon |
| Yaliang Wu | [ylwu-amzn](https://github.com/ylwu-amzn) | Amazon |
| Maintainer | GitHub ID | Affiliation |
|----------------------|-----------------------------------------------------|-------------|
| Bhavana Goud Ramaram | [rbhavna](https://github.com/rbhavna) | Amazon |
| Charlie Yang | [model-collapse](https://github.com/model-collapse) | Amazon |
| Dhrubo Saha | [dhrubo-os](https://github.com/dhrubo-os) | Amazon |
| Jing Zhang | [jngz-es](https://github.com/jngz-es) | Amazon |
| Junshen Wu | [wujunshen](https://github.com/wujunshen) | Amazon |
| Sicheng Song | [b4sjoo](https://github.com/b4sjoo) | Amazon |
| Xun Zhang | [Zhangxunmt](https://github.com/Zhangxunmt) | Amazon |
| Yaliang Wu | [ylwu-amzn](https://github.com/ylwu-amzn) | Amazon |
| Zan Niu | [zane-neo](https://github.com/zane-neo) | Amazon |
| Austin Lee | [austintlee](https://github.com/austintlee) | Aryn |
| Henry Lindeman | [HenryL27](https://github.com/HenryL27) | Aryn |

## Emeritus

| Maintainer | GitHub ID | Affiliation |
| ----------- | ------------------------------------------------- | ----------- |
| Jackie Han | [jackiehanyang](https://github.com/jackiehanyang) | Amazon |
| Mingshi Liu | [mingshl](https://github.com/mingshl) | Amazon |
75 changes: 75 additions & 0 deletions TRIAGING.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@

<img src="https://opensearch.org/assets/img/opensearch-logo-themed.svg" height="64px">


The maintainers of the OpenSearch ml-commons Repo seek to promote an inclusive and engaged community of contributors. In order to facilitate this, weekly triage meetings are open-to-all and attendance is encouraged for anyone who hopes to contribute, discuss an issue, or learn more about the project. To learn more about contributing to the OpenSearch ml-commons Repo visit the [Contributing](https://github.com/opensearch-project/ml-commons/blob/main/CONTRIBUTING.md) documentation.

### Do I need to attend for my issue to be addressed/triaged?

Attendance is not required for your issue to be triaged or addressed. All new issues are triaged weekly.

### What happens if my issue does not get covered this time?

Each meeting we seek to address all new issues. However, should we run out of time before your issue is discussed, you are always welcome to attend the next meeting or to follow up on the issue post itself.

### How do I join the Backlog & Triage meeting?

Meetings are hosted regularly at 9AM PST time and can be joined via the links posted on the [Upcoming Events](https://opensearch.org/events) webpage.

After joining the Zoom meeting, you can enable your video / voice to join the discussion. If you do not have a webcam or microphone available, you can still join in via the text chat.

If you have an issue you'd like to bring forth please consider getting a link to the issue so it can be presented to everyone in the meeting.

### Is there an agenda for each week?

Meetings are lightly structured as follows:

1. Announcements: If there are any announcements to be made they will happen at the start of the meeting.
2. Review of new issues: The meetings always start with reviewing all untriaged issues for the [ml-commons](https://github.com/opensearch-project/ml-commons/issues?q=is%3Aopen+is%3Aissue+label%3Auntriaged) and [ml-commons-dashboards](https://github.com/opensearch-project/ml-commons-dashboards/issues?q=is%3Aissue+is%3Aopen+label%3Auntriaged) repositories.
3. Untriaged items: Review any issues that might have had the 'untriaged' label removed but require additional triage discussion.
4. Pull request discussion: Then, we review the status of outstanding pull requests from the ml-commons and ml-commons-dashboards repositories.
5. Open discussion: Finally, we open the floor in case anyone wants to highlight an issue.

There is no specific ordering within each category.
If you have an issue you would like to discuss but do not have the ability to attend the entire meeting please attend when is best for you and signal that you have an issue to discuss when you arrive.

### Do I need to have already contributed to the project to attend a triage meeting?

No, all are welcome and encouraged to attend. Attending the Backlog & Triage meetings is a great way for a new contributor to learn about the project as well as explore different avenues of contribution.

### What if I have an issue that is almost a duplicate, should I open a new one to be triaged?

You can always open an issue including one that you think may be a duplicate. However, in cases where you believe there is an important distinction to be made between an existing issue and your newly created one, you are encouraged to attend the triaging meeting to explain.

### What if I have follow-up questions on an issue?

If you have an existing issue you would like to discuss, you can always comment on the issue itself. Alternatively, you are welcome to come to the triage meeting to discuss.

### Is this meeting a good place to get help setting up ML features on my OpenSearch instance?

While we are always happy to help the community, the best resource for implementation questions is [the OpenSearch forum](https://forum.opensearch.org/c/plugins/ml/46).
There you can find answers to many common questions as well as speak with implementation experts.

### What are the issue labels associated with triaging?

Yes, there are several labels that are used to identify the 'state' of issues filed in OpenSearch and the plugins.

|Label |When applied |Meaning |
|--- |--- |--- |
|Untriaged |When issues are created or re-opened. |Issues labeled as 'Untriaged' require the attention of the repository maintainers and may need to be prioritized for quicker resolution. It's crucial to keep the count of 'Untriaged' labels low to ensure all potential issues are addressed in a timely manner. |
|Triaged |During triage meetings. |Issues labeled as 'Triaged' have been reviewed and are deemed actionable. Opening a pull request for an issue with the 'Triaged' label has a higher likelihood of approval from the project maintainers, particularly in novel areas. |
|Neither Label |During triage meetings. |This category is for issues that lack sufficient details to formulate a potential solution. Until more details are provided, it's difficult to ascertain if a proposed solution would be acceptable. When dealing with an 'Untriaged' issue that falls into this category, the triage team should provide further insights so the issue can be appropriately closed or labeled as 'Triaged'. Issues in this state are reviewed during every triage meeting. |
|Help Wanted |Anytime. |Issues marked as 'Help Wanted' signal that they are actionable and not the current focus of the project maintainers. Community contributions are especially encouraged for these issues. |
|Good First Issue |Anytime. |Issues labeled as 'Good First Issue' are small in scope and can be resolved with a single pull request. These are recommended starting points for newcomers looking to make their first contributions. |

### What if my issue is critical to OpenSearch operations, do I have to wait for the weekly meeting for it to be addressed?

All new issues for the ml-commons repo and ml-commons-dashboards repo are reviewed daily to check for critical issues which require immediate triaging. If an issue relates to a severe concern for OpenSearch operation, it will be triaged by a maintainer mid-week. You can still come to discuss an issue at the following meeting even if it has already been triaged during the week.

### Is this where I should bring up potential security vulnerabilities?

Due to the sensitive nature of security vulnerabilities, please report all potential vulnerabilities directly by following the steps outlined on the [SECURITY.md](https://github.com/opensearch-project/security/blob/main/SECURITY.md) document.

### Who should I contact if I have further questions?

You can always file an issue for any question you have about the project. Alternatively, you can reach out to specific contacts helping to organize the project: Dhrubo Saha ([[email protected]](mailto:[email protected])), Jing Zhang ([[email protected]](mailto:[email protected])), Xun Zhang ([[email protected]](mailto:[email protected]))and Yaliang Wu ([[email protected]](mailto:[email protected])).
10 changes: 10 additions & 0 deletions ml-algorithms/build.gradle
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@ plugins {
id 'java'
id 'jacoco'
id "io.freefair.lombok"
id 'com.diffplug.spotless' version '6.18.0'
}

repositories {
Expand Down Expand Up @@ -96,3 +97,12 @@ jacocoTestCoverageVerification {
dependsOn jacocoTestReport
}
check.dependsOn jacocoTestCoverageVerification

spotless {
java {
removeUnusedImports()
importOrder 'java', 'javax', 'org', 'com'

eclipse().configFile rootProject.file('.eclipseformat.xml')
}
}
24 changes: 12 additions & 12 deletions ml-algorithms/src/main/java/org/opensearch/ml/engine/MLEngine.java
Original file line number Diff line number Diff line change
Expand Up @@ -5,23 +5,25 @@

package org.opensearch.ml.engine;

import lombok.Getter;
import lombok.extern.log4j.Log4j2;
import java.nio.file.Path;
import java.util.Locale;
import java.util.Map;

import org.opensearch.ml.common.FunctionName;
import org.opensearch.ml.common.MLModel;
import org.opensearch.ml.common.dataframe.DataFrame;
import org.opensearch.ml.common.dataset.DataFrameInputDataset;
import org.opensearch.ml.common.dataset.MLInputDataset;
import org.opensearch.ml.common.input.Input;
import org.opensearch.ml.common.input.parameter.MLAlgoParams;
import org.opensearch.ml.common.input.MLInput;
import org.opensearch.ml.common.input.parameter.MLAlgoParams;
import org.opensearch.ml.common.model.MLModelFormat;
import org.opensearch.ml.common.output.MLOutput;
import org.opensearch.ml.common.output.Output;
import org.opensearch.ml.engine.encryptor.Encryptor;
import java.nio.file.Path;
import java.util.Locale;
import java.util.Map;

import lombok.Getter;
import lombok.extern.log4j.Log4j2;

/**
* This is the interface to all ml algorithms.
Expand Down Expand Up @@ -92,10 +94,7 @@ public Path getDeployModelRootPath() {
}

public Path getDeployModelChunkPath(String modelId, Integer chunkNumber) {
return mlModelsCachePath.resolve(DEPLOY_MODEL_FOLDER)
.resolve(modelId)
.resolve("chunks")
.resolve(chunkNumber + "");
return mlModelsCachePath.resolve(DEPLOY_MODEL_FOLDER).resolve(modelId).resolve("chunks").resolve(chunkNumber + "");
}

public Path getModelCachePath(String modelId, String modelName, String version) {
Expand Down Expand Up @@ -145,7 +144,8 @@ public MLOutput predict(Input input, MLModel model) {
public MLOutput trainAndPredict(Input input) {
validateMLInput(input);
MLInput mlInput = (MLInput) input;
TrainAndPredictable trainAndPredictable = MLEngineClassLoader.initInstance(mlInput.getAlgorithm(), mlInput.getParameters(), MLAlgoParams.class);
TrainAndPredictable trainAndPredictable = MLEngineClassLoader
.initInstance(mlInput.getAlgorithm(), mlInput.getParameters(), MLAlgoParams.class);
if (trainAndPredictable == null) {
throw new IllegalArgumentException("Unsupported algorithm: " + mlInput.getAlgorithm());
}
Expand Down Expand Up @@ -180,7 +180,7 @@ private void validateMLInput(Input input) {
throw new IllegalArgumentException("Input data set should not be null");
}
if (inputDataset instanceof DataFrameInputDataset) {
DataFrame dataFrame = ((DataFrameInputDataset)inputDataset).getDataFrame();
DataFrame dataFrame = ((DataFrameInputDataset) inputDataset).getDataFrame();
if (dataFrame == null || dataFrame.size() == 0) {
throw new IllegalArgumentException("Input data frame should not be null or empty");
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -5,15 +5,6 @@

package org.opensearch.ml.engine;

import org.apache.commons.beanutils.BeanUtils;
import org.apache.logging.log4j.LogManager;
import org.apache.logging.log4j.Logger;
import org.opensearch.ml.common.exception.MLException;
import org.opensearch.ml.common.FunctionName;
import org.opensearch.ml.engine.annotation.ConnectorExecutor;
import org.opensearch.ml.engine.annotation.Function;
import org.reflections.Reflections;

import java.lang.reflect.Constructor;
import java.security.AccessController;
import java.security.PrivilegedActionException;
Expand All @@ -22,6 +13,14 @@
import java.util.Map;
import java.util.Set;

import org.apache.commons.beanutils.BeanUtils;
import org.apache.logging.log4j.LogManager;
import org.apache.logging.log4j.Logger;
import org.opensearch.ml.common.FunctionName;
import org.opensearch.ml.common.exception.MLException;
import org.opensearch.ml.engine.annotation.ConnectorExecutor;
import org.opensearch.ml.engine.annotation.Function;
import org.reflections.Reflections;

public class MLEngineClassLoader {

Expand Down Expand Up @@ -138,7 +137,7 @@ public static <T, S, I extends Object> S initInstance(T type, I in, Class<?> con
} catch (Exception e) {
Throwable cause = e.getCause();
if (cause instanceof MLException) {
throw (MLException)cause;
throw (MLException) cause;
} else {
logger.error("Failed to init instance for type " + type, e);
return null;
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,10 @@

package org.opensearch.ml.engine;

import org.opensearch.ml.common.MLModel;

import java.util.Map;

import org.opensearch.ml.common.MLModel;

public interface MLExecutable extends Executable {

/**
Expand Down
Loading

0 comments on commit 5be107b

Please sign in to comment.