LFX Mentorship Autumn 2024 Pretest - FuryMartin #139

FuryMartin · 2024-08-24T18:08:25Z

What type of PR is this?

/kind design

What this PR does / why we need it:

This is a demo for #130. I have implemented a cloud-edge collaborative strategy named query-routing.

The whole principle for query-routing is as below, which will route user's query to cloud or edge model based on its difficulty coefficiency:

For task 1, I used a modified Sedna package to support jsonl data evaluation.
For task 2, I implemented query-routing based on Ianvs.
For task 3, I left the results in README.md. The design details can be found in cloud-edge-collaboration-inference-for-llm

Which issue(s) this PR fixes:

Fixes #130

Signed-off-by: Yu Fan <[email protected]>

kubeedge-bot · 2024-08-24T18:08:31Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign jaypume after the PR has been reviewed.
You can assign the PR to them by writing /assign @jaypume in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

FuryMartin · 2024-08-24T18:10:40Z

This PR should be reviewed by @hsj576.

hsj576

The overall work is good, basically completed the pretest requirements.

FuryMartin · 2024-08-30T08:25:56Z

The overall work is good, basically completed the pretest requirements.

Thanks!

Considering that this PR is just a demonstration, should I close it?

FuryMartin and others added 10 commits July 18, 2024 10:18

add proposal: cloud-edge-collaboration-inference-for-llm

a340bba

Signed-off-by: Yu Fan <[email protected]>

Update proposal: cloud-edge-collaboration-inference-for-llm

167895b

Signed-off-by: Yu Fan <[email protected]>

Update proposal: cloud-edge-collaboration

863c812

Signed-off-by: Yu Fan <[email protected]>

Update proposal: cloud-edge-collaboration

b3e7673

Signed-off-by: Yu Fan <[email protected]>

add: introduce Joint Inference as a new paradigm

4ac560b

Signed-off-by: Yu Fan <[email protected]>

add: example for LLM joint inference

6824017

Signed-off-by: Yu Fan <[email protected]>

add: support jsonl dataset

6f25da5

Signed-off-by: Yu Fan <[email protected]>

fix: import error and interface alignment

a77c949

Signed-off-by: Yu Fan <[email protected]>

add: edge-rate metric for joint inference example

e3531b7

Signed-off-by: Yu Fan <[email protected]>

add: README.md for cloud-edge joint inference example

0efb86a

Signed-off-by: Yu Fan <[email protected]>

kubeedge-bot added the kind/design Categorizes issue or PR as related to design. label Aug 24, 2024

kubeedge-bot requested review from jaypume and Poorunga August 24, 2024 18:08

kubeedge-bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Aug 24, 2024

MooreZheng assigned hsj576 Aug 29, 2024

hsj576 reviewed Aug 30, 2024

View reviewed changes

FuryMartin closed this Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LFX Mentorship Autumn 2024 Pretest - FuryMartin #139

LFX Mentorship Autumn 2024 Pretest - FuryMartin #139

FuryMartin commented Aug 24, 2024

kubeedge-bot commented Aug 24, 2024

FuryMartin commented Aug 24, 2024

hsj576 left a comment

FuryMartin commented Aug 30, 2024

LFX Mentorship Autumn 2024 Pretest - FuryMartin #139

LFX Mentorship Autumn 2024 Pretest - FuryMartin #139

Conversation

FuryMartin commented Aug 24, 2024

kubeedge-bot commented Aug 24, 2024

FuryMartin commented Aug 24, 2024

hsj576 left a comment

Choose a reason for hiding this comment

FuryMartin commented Aug 30, 2024