⚠️ Challenge DF is now retired. Details. From now on, data scientists can compete in predicting ETH by running Ocean Predictoor bots.
This repo should be used for archival / reference purposes only.
This README provides instructions to participate in Challenge Data Farming (DF).
Prize Pool: 1,000 OCEAN (as of Nov 2, 2022)
- 1st place: 500 OCEAN
- 2nd place: 300 OCEAN
- 3rd place: 200 OCEAN
We will identify winners by the blockchain account they use in the competition (on Mumbai). Rewards get distributed as part of DF rewards: we allocate OCEAN to winners' accounts on Ethereum mainnet as part of Active Rewards contract.
Challenge DF is a recurrent weekly challenge, as part of DF.
- Submission deadline: Every Wednesday at 23:59 UTC when the Data Farming round finishes.
- Prediction at times: Every Thursday at 00:05 UTC, 00:10, ..., 1:00 (12 predictions total).
The winner = whoever has lowest prediction error. That's all. 📈
To be eligible, competitors must produce the outcomes that this README guides. This includes:
- Created an Ocean data NFT
- On the data NFT, set a value correctly: correct field label, correct # predictions, prediction values following correct formatting, predictions encrypted with proper encoding on judges' public key
- Transferred data NFT to Ocean judges before the submission deadline
- All on Mumbai network, not another network
If there is >1 submission by the same address, then the most recent one (that still meets the deadline) is used.
- Go to the DF Webapp's Active Rewards section.
- Click the "CLAIM REWARDS" button. This will claim all your Active DF rewards -- from Challenge DF and otherwise.
If you encounter issues, feel free to reach out ✋
This readme describes a basic flow to predict future ETH price, and submit your predictions to contest judges. We'll be using Mumbai, which is Polygon's testnet.
Here are the steps:
- Setup
- Get data locally
- Make predictions
- Publish & share predictions
In ocean.py's install.md, follow all steps.
The predict-eth library has a specific error calculation function, and other functions specific to this competition. In the console:
pip install predict-eth
The READMEs use several numerical & ML libraries. In the console:
pip install python-dateutil==2.8.1 ccxt eth_account matplotlib numpy pandas==1.5.3 prophet requests scikit-learn
In ocean.py's setup-remote.md, follow all steps.
Make sure you're in running in Mumbai!
In the Python console:
from predict_eth.helpers import *
Here, use whatever data you wish. It can be static data or streams, free or priced, raw data or feature vectors or otherwise. It can be published via Ocean, or not. The main README links to some options.
This demo flow skips getting data because it will generate random predictions (no data needed).
Here, build whatever AI/ML model you want, leveraging the data from the previous step. The main README links to some options.
This demo flow skips building a model because it will generate random predictions (no model needed).
Predictions must be one prediction every 5mins, for a 60min period. The specific times were given above. There are 12 predictions total. The output is a list with 12 items.
Here's an example with random numbers. In the same Python console:
#get predicted ETH values
mean, stddev = 1800, 25.0
pred_vals = list(np.random.normal(loc=mean, scale=stddev, size=(12,)))
We use normalized mean-squared error (NMSE) as the accuracy measure.
In the same Python console:
# get the time range we want to test for
start_dt = datetime.datetime.utcnow() - datetime.timedelta(minutes=120) #must be >= 60min ago; we use 120
start_dt = round_to_nearest_timeframe(start_dt) # so that times line up
target_uts = target_12_unixtimes(start_dt)
print_datetime_info("target times", target_uts)
# get the actual ETH values at that time
import ccxt
allcex_x = ccxt.binance().fetch_ohlcv('ETH/USDT', '5m')
allcex_uts = [xi[0]/1000 for xi in allcex_x]
allcex_vals = [xi[4] for xi in allcex_x]
print_datetime_info("allcex times", allcex_uts)
cex_vals = filter_to_target_uts(target_uts, allcex_uts, allcex_vals)
# now, we have predicted and actual values. Let's find error, and plot!
nmse = calc_nmse(cex_vals, pred_vals)
print(f"NMSE = {nmse}")
plot_prices(cex_vals, pred_vals)
Keep iterating in step 3 until you're satisfied with accuracy. Then...
We'll now publish the asset on Mumbai. As a result of completing the remote setup in Step 1.4, you should be ready to go.
In the same Python console:
# Imports
from ocean_lib.ocean import crypto
# Create data NFT
data_nft = ocean.data_nft_factory.create({"from": alice}, 'Data NFT 1', 'DN1')
print(f"Created data NFT with address={data_nft.address}")
# Encrypt predictions with judges' public key, so competitors can't see.
# NOTE: public key is *not* the same thing as address. Using address will not work.
judges_pubkey = '0x3d87bf8bde8c093a16ca5441b5a1053d34a28aca75dc4afffb7a2a513f2a16d2ac41bac68d8fc53058ed4846de25064098bbfaf0e1a5979aeb98028ce69fab6a'
pred_vals_str = str(pred_vals)
pred_vals_str_enc = crypto.asym_encrypt(pred_vals_str, judges_pubkey)
# Store predictions to data NFT, on-chain
data_nft.set_data("predictions", pred_vals_str_enc, {"from": alice})
# Transfer the data NFT to judges, for prediction tamper-resistance
judges_address = '0xA54ABd42b11B7C97538CAD7C6A2820419ddF703E'
token_id = 1
tx = data_nft.safeTransferFrom(alice.address, judges_address, token_id, {"from": alice})
# Ensure the transfer was successful
assert get_transfer_event(ocean, data_nft, tx).args.to.lower() == judges_address.lower()
# Print txid, as we'll use it in the next step
print(f"txid from transferring the nft: {tx.transactionHash.hex()}")
Section 0.3 "Criteria to win" has a checklist of things you need to have done. Ensure that you've done these. If you missed any, you will not be eligible.
And if that's good, then...
Congratulations! You've now made your submission to the challenge! 🎉
This section is optional for you, to get a flavor of how judging looks.
Judging happens automatically as part of weekly DF operations. Your nmse will be auto-computed and compared with nsme of others.
The actual judging code is at df-py repo in 'challenge' sub-directory. What follows is simply a guideline.
In the terminal:
export REMOTE_TEST_PRIVATE_KEY1=<judges' private key, having address 0xA54A..>
export RPC_URL=https://polygon.llamarpc.com # or the RPC of your choice
In the same Python console:
# setup
from ocean_lib.models.data_nft import DataNFT
from ocean_lib.ocean import crypto
from predict_eth.helpers import *
import os
ocean = create_ocean_instance(os.getenv("RPC_URL"))
alice = create_alice_wallet(ocean) # the judge is Alice
# specify target times
# start_dt = round_to_nearest_hour(datetime.datetime.now() - datetime.timedelta(hours=24)) # use this if you're following up from above
start_dt = datetime.datetime(2023, 6, 7, 23, 59) #Jun 7, 2023 at 23.59 UTC # judges use this
target_uts = target_12_unixtimes(start_dt)
print_datetime_info("target times", target_uts)
# get predicted ETH values
data_nft_addr = <addr of your data NFT. Judges will find this from the chain>
data_nft = DataNFT(ocean.config_dict, data_nft_addr)
pred_vals_str_enc = data_nft.get_data("predictions")
pred_vals_str = crypto.asym_decrypt(pred_vals_str_enc, alice._private_key.hex())
pred_vals = [float(s) for s in pred_vals_str[1:-1].split(',')]
# get actual ETH values (final)
import ccxt
cex_x = ccxt.binance().fetch_ohlcv('ETH/USDT', '5m')
allcex_uts = [xi[0]/1000 for xi in cex_x]
allcex_vals = [xi[4] for xi in cex_x]
print_datetime_info("all CEX data info", allcex_uts)
cex_vals = filter_to_target_uts(target_uts, allcex_uts, allcex_vals)
print(f"cex ETH price is ${cex_vals[0]} at start_dt of {start_dt}")
print(f"cex_vals: {cex_vals}")
# calc nmse, plot
nmse = calc_nmse(cex_vals, pred_vals)
print(f"NMSE = {nmse}")
plot_prices(cex_vals, pred_vals)