Skip to content

Latest commit

 

History

History
799 lines (646 loc) · 25.4 KB

README.md

File metadata and controls

799 lines (646 loc) · 25.4 KB

Class_notes_BME547

I will use this repos to take down notes while learning and practice the skill I learned meanwhile.

Lec Jan 18__Intro to git

  • Mac change prompt main

  • git commit -m “purpose”

  • Commit steps:

    1. nano file
    2. git status
    3. git add file
    4. git status
    5. git commit -m “purpose”
  • Open .

  • Two space at the end to start a new line

  • ls -a

  • git log --oneline

  • To cover the content on github, follow the steps below:

    1. git add .
    2. git commit -m""
    3. git pull
    4. git push
  • Use different signs to start different samples: *,-,+ and with "\t"

  • git commit [forget to have purpose, type "I" to switch to insert mode, then "esc" to ":", type "wq" to quit] [refer to "escaping VIM editor when git commit"

  • markdown formatting "# Title", "## Subtitle"

  • compare versions of commits:

    1. git log --oneline
    2. git diff xxx..xxx [deleted contents shown in red, added changes shown in green?]
    3. use arrows to display and "esc" to quit
  • git diff, git diff xxx..xxx, git diff --staged

  • Can also made changes directly on github, remember to change commit description

  • rm -rf file

  • rm file

  • command+T [new window to continue]

branch

  • git branch
  • git branch test_info [create new branch]
  • git checkout test_info [change branch]
  • Use "git add" to single file and use "git commit" to multiple files [sometime fixing same problem]
  • Recommend merging branches on github [to test and get permission]
    • git push --set-upstream..
    • on github, choose "pull requests", choose branches and then merge

Lec Jan 20__Intro to Python

  • Function:

    • take input
    • do the work
    • return the output
  • use "Tab" to automatically fill the filename

    • print("The answer is {}".format(x)) [give variable]
    • print("The HDL result of {} is connected {}".format(HDL_value, HDL_analy))
    • print("The HDL result of {:.1f} is connected {}".format(HDL_value, HDL_analy)) [display a single decimal place]
    • round(weight,1) [give rounded number with specified number of decimals]
  • functions can have multiple returns with different types of variables [only Python can do that, no need to specify whether Boolean or Integer] [image01] [image02] [image03]

  • input("prompt") [get input from keyboard][this will give a string][need to transfer to integer--int()] [image04]

  • To keep updated, when doing projects, always remeber to checkout to main branch and then git pull

  • in-function variables are local so same name of variables can be used in different functions

  • "if, elif, else" in Python [image05]

  • "while" in Python [image06]

  • seperate the function name and variable name

Lec Jan 25__Modules and Virtual Environments

  • import files:

    1. import file [give access to all functions included] filename.function name
    2. import ...[filename] as ..[shortcut]
    3. from filename import function_name [import single function] when use the function, just use its name [image08]
  • special variable with two "_" in front and back of it eg: format(name)

with main module -- gives the main with other module -- gives the actual name [image07]

  • comment sign "#"

  • if name == "main": [!!] [use this to later input anything, always shows at the bottom of the file]

  • pycache file

  • touch .gitignore [!!] [usually do add this file in your repository][ignore everthing from there, include filename here]

  • .gitignore always include:

    • __pycache__
    • .ipynb_checkpoints
    • myvenv [not add virtual env to your repository]
  • Create virtual env: 0. create a new branch to setup venv

    1. python -m venv myvenv [on Mac][create virtual environment]
    2. source virtual_env_name/bin/activate [activate the env]
    3. nano requirements.txt [to capture a file with packages I need]
      • usually include:
        • jupyter
        • pytest [this two are for automatic testing]
        • pytest-pycodestyle
    4. touch .gitignore [.github/.DS_Store, venv_name]
    5. pip install -r requirements.txt
    6. pip list [show what package you download]
    7. git add .gitignore
    8. git commit
    9. git push branch
    10. deactivate
  • pip install package_name [easy way but not recommended]

  • install package:[!!]

    1. nano requirements.txt [to capture a file with packages I need]
    2. pip install -r requirements.txt
    3. pip list [show what package you download]
  • Jupyter notebook:

    1. "shift+enter" [run jupyter notebook]
    2. "ctrl+enter" [run but don‘t create new line]
    3. cell-> cell type -> markdown
    4. "ctrl+C" to exit
  • create a new jupyter notebook:

    • in terminal-type "jupyter notebook"
    • new-python3
  • sometime we don‘t push branches, we just commit them and leave them, then swith to main branch

  • virtual env does nothing to git, they are seperate

  • Markdown format
    https://github.com/adam-p/markdown-here/wiki/Markdown-Cheatsheet
    text bold
    emphasized text
    $\LaTeX$
    $y = x^2$
    a single back-tick(`) [give punctuation]
    if italic text

  • %whos 【to inspect the types of all active variables】

  • Python functions:

    • type() [see the type] [image09]

    [image10]

    • tuples and arrays
      • the first index of a list (or tuple or any array) is 0 in Python; MATLAB starts indexing at 1
      • access items of a tuple using []; MATLAB uses ().
      • In Python, () is used to call functions and methods.
  • commont terms: parenthese [ () ]

  • nest tuples within tuples... and can mix datatypes [image11]

    • tuples are immutable
  • lists

    • Lists are mutable.
    • tuples are created and printed using parentheses (example: my_tuple = (1, 2, 3));
    • lists are created and printed using square brackets (example: my_list = [1, 2, 3]).
    • both use [ ] to access the contents
    • list functions:
      • .append(), .pop(), .count(), .reverse() and .sort()
      • lists are mutable [image12]
    • List Comprehensions:
      • Python uses lists instead of loops:
        [x**2 for x in (1, 2, 3)]  # Output: [1, 4, 9]
        
      • x*2
      • x**2 [x^2]
  • for loops

    • Python can directly iterate on the content of a tuple,
    • unlike MATLAB, which likes to iterate on a specified range of numbers [image13]
    • conditionals in a list comprehension image[image15]
    • two different ways for iterating lists
      • iterating one list
    for i, patient in enumerate(db)
    print("Patient {} is in room {}".format(patient[0], room_numbers[i]))
    
    
    • iterating both lists
    for patient, rn in zip(db, room_numbers)
    print("Patient {} is in room {}".format(patient[0], rn))
    

    db, room_numbers should be in same size

  • enumerate()

    • carry along a numerical index that corresponds to which iteration you are on in the loop
    • first number is index, second one is the element [image14]

Lec Jan 27__Merge conflicts, Modulartiy and Tags

  • reasons to write modular codes:

    • organization(readibilty)
    • testing
    • reusability
  • tags: [start from that point]

    • git tag -a tag_name -m" " [usually gives after the merge pull action] [usually tag on main branch][push tags and push branches are seperate processes]
    • git tag -a tag_name commit_id -m" "
    • git checkout v1.0
    • git tag [list all your tags]
    • git show v1.0
    • git push origin v1.0
    • git push --tags

Lec Feb 1__Lists and loops

  • is and ==
if found_patient is False:

here we only use is when there is a boolean variable

  • mutable varibles (like lists, dictionary and classes) can be changed without return in a function, it doesn't work well with imutable variables (like strings, numbers)

  • do add a return at the end of the function, it is visible for you showing the end of function, you can just use return sentence

Lec Feb 3__Unit testing

  • pytest convention

    • file name for testing: test_+file_name [always name like this thus pytest will automatically run the file with this name]
    • function name for testing: test+function_name
  • 3As for testing:

    • Arrange [~ data for testing]
    • Act [call function]
    • Assert [~ to check if the function is correct]
      assert a boolean/condition sentence 
      
      [continue ruuning if True, throw error if False, thus pytest will use the error to know the function fails] eg: assert answer == "Normal" eg:
    def test_HDL_analysis():
    from blood_calculator import HDL_analysis
    # Arrange
    HDL_input = 65
    # Act
    answer = HDL_analysis(HDL_input)
    # Assert
    assert answer == "Normal" 
    
  • pytest or pytest -v[give more details of testing] pytest + filename [only run specific file]

  • function decorator: [gives addtional function alias] import module_name
    @module_name.mark.parametrize()

import pytest 

@pytest.mark.parametrize("HDL_input, expected", [
(65, "Normal"),
(40, "Borderline Low"),
(20, "Low")
])
def test_HDL_analysis(HDL_input, expected):
    from blood_calculator import HDL_analysis
    # Arrange
    # HDL_input = 65
    # Act
    answer = HDL_analysis(HDL_input)
    # Assert
    assert answer == expected 
  • unit testing:

    • we don't need to test every input, we just test every branch
    • if two test funtinos have the same name, the test will only output the last one
    • always write unit test before writing the function itself
    • write test and function itself on the same branch
  • pytest -v --pycodestyle

    • PEP8 convention

Lec Feb 8__Unit testing

  • test: True, False, border eg: if assert age >=18, test(15, 18, 20)

  • test

import math
def area_of_ellipse(x: list):
    data = x.split(",")
    if len(data) = 1:
        area = math.pi * data[0]**2
    elif len(data) = 2:
        area = math.pi * data[0] * data[1]
    elif len(data) = 4:
        a = (data[2] - data[0]) / 2
        b = (data[3] - data[1]) / 2
        area = math.pi * a * b
    else:
        area = None
    return area

test: [ ([1.5], area) ([1.5, 2.5], area) ([1, 10, 20, 30], area) ([1, 2, 3], None) ([], None) ]

  • plural [复数, eg: apples]

  • robust test example:

      def parse_weight_input(weight_input):
          if weight_input == "":
              return None
          if weight_input.find(" ") < 0:
              return None
          weight, units = weight_input.split(' ')
          weight = float(weight)
          units = units.lower()
          if units[-1] == "s":
              units = units[0:-1]
          if units in ["lb", "pound"]:
              weight_kg = convert_lb_to_kg(weight)
          elif units == "kg":
              weight_kg = weight
          else:
              return None
          weight_kg = round(weight_kg)
          return weight_kg
    
    • weight_input.find(" "):

      • return -1 if there is no string
      • return the location if find the string
    • units = lbs

      • units[-1] = s
      • units[0:-1] = lb
  • when testing a function that will call another function, test the two different functions seperately

  • pytest.approx(0.3, abs=0.1) [0.3 +- 0.1] [ Decimals are stored in binary, doesn't exactly equal to 0.3 ] [ abs changes the absolute error] eg: import pytest def test_add(): from add import add answer = add(0.1, 0.2) assert answer == pytest.approx(0.3)

  • always create the .github folder when start a new repository, but after create venv

Lec Feb 10__Dictionaries and Classes

  • <= 29.9 and <30 [for the first case, 29.95 can not be captured, so be specific about the decimal comparison]

  • dictionary:

    • more readable and understandable to use key rather than index
    • as the dictionary gets larger, the search time won't increase [just give result based on the key][no meaning to iterate a larger number of patients, just search]
  • use None instead of making error

    • use None:
    patient = db.get(mrn_to_find)
    if patient is None:
            return False
    return patient
    
    • will get error
    patient = db[mrn_to_find]
    for patient in db:
        if patient["MRN"] == mrn_to _find:
            return patient
    return False
    
    • db.values()
    for key in db:
        print("MRN: {}, Full Name: {}, Age: {}".format(db[key][patient]["MRN"],
                                                       get_full_name(patient),
                                                       db[key][patient]["Age"]))
    

    [return error since patient are keys]

    for patient in db.values():
        print("MRN: {}, Full Name: {}, Age: {}".format(patient["MRN"],
                                                       get_full_name(patient),
                                                       patient["Age"]))
    

    [patient now is dictionary]

    • object-oriented programming:

      • use of class:
        • put all the data and methods to a capsule
      • example:
      class Patient:
      
      def __init__(self):
      self.first_name = ""
      self.last_name = ""
      self.mrn = 0
      self.age = 0
      self.tests = []
      
      def get_full_name(self):
      full_name = "{} {}".format(self.first_name,
                                 self.last_name)
      return full_name
      
      
      def main():
      new_patient = Patient()
      print(new_patient)
      print(new_patient.get_full_name())
      
      
      if __name__ == "__main__":
      main()
      
      • class name capitaled in PEP8
      • new_patient = Patient() [create a new instance]
      • print shows the memory location
      • each class variable has its instance [instance's value can be changed but it is pointed to a certain location when new_patient is created.]
      • always put self at the frist place of the parameter list
      • python will ignore self when calling the function
      • when save sth to part of the class instance, use self
    • init() function ["__" in python means special name in python]

    • module examples: [i0]

  • json https://github.com/dward2/BME547/blob/main/Lectures/json.md

  • docstrings https://github.com/dward2/BME547/blob/main/Lectures/docstrings.md

  • jupyter notebook jupyter_practice.ipynb [run xx.ipynb file]

  • in_file = open("input_data.txt","r") https://github.com/dward2/BME547/blob/main/Lectures/file_input.md

    • first string: file_name [
      • if want to open a file in the main folder of your repository, just file_name
      • if in other folders, folder_name/file_name ]
  • second string:

    • "r" - read
    • "w" - write
    • "a" - append
  • read a file

    in_file = open("input_data.txt","r")
    fruits = in_file.readlines()
    print(fruits)
    in_file.close()
    
    • file_name.readlines() [read all file]
    • file_name.readline() [read one line]??? seperaete??
    • in_file = open()
    • in_file.close()
    in_file = open("input_data.txt", "r")
    for line in in_file:
        print(line, end="")
    
  • close file:

    • with open("input_file.txt", "r") as in_file: [the file automatically closed once with block finished]
    with open("input_file.txt", "r") as in_file:
    for line in in_file:
        print(line, end="")
    
    • in_file.close()
  • .strip("xx") [remove leading and trailing characters or sign]

    • \n, \t
  • test_results = [int(i) for i in tests] [i1]

  • print(line, end="")

  • two main errors in python: https://github.com/dward2/BME547/blob/main/Lectures/exceptions.md

    • SyntaxError [python can not translate the code][code didn't run]
    • Anything else: exception errors [can run but some mistakes exist] [i2]
    • try/except
    • finally clause
    • raise exceptions
    • test for raised exceptions
    with pytest.raises(ValueError):
        move_direction("L")
    

    [i3]

    • warning [i4]
    • sys.exit()

Lec Feb 17__logging/Numpy/Matplotlib

  • test case only have one single reason to fail

    • always arrage conditionals logically
  • python org --find functions pypi.org --find packages [python pacakage index] packages-modules[math]-functions[.sqrt]

  • logging: print result to a file, not to the console

  • logging package are default library in python, no need to include in venv but need to import

  • levels in the logging module; least important to most important: debug-info-warning-error-critical

  • change logging level: if name == 'main': logging.basicConfig(level = logging.INFO, filename = "signal.log", filemode = "w")

    • filemode: a(default), w
  • order:

    1. import logging
    2. write logging command: eg: logging.info("xxx")
    3. logging.basicConfig(level = logging.INFO, filename = "signal.log", filemode = "w")
    4. unit test for logging https://github.com/dward2/BME547/blob/main/Resources/testing_logging.md
      1. include testfixtures in venv
      2. write unit testing
        # test_my_functions.py
            from testfixtures import LogCapture
        
            def test_log_if_negative_is_made():
                from my_functions import log_if_negative
                with LogCapture() as log_c:
                    log_if_negative(-1)
                log_c.check(("root", "INFO", "a is a negative number"))
        
  • elif new_data not in "+-0" [not in]

  • file.read(1) [return 1 byte in the file, continue reading for the second command]

  • import numpy as np

    • list in python vs array in numpy list: with "," array: without ","
    • np.zeros(10, dtype=int) [default type is float] np.ones(10)
    • np.arange(start:end:interval) eg:[(0,10,0.1)] [default interval is 1]
    • np.sin(t)
  • import matplotlib.pyplot as plt

    • plt.plot(t,y1)
    • plt.plot(t,y2) [plots are shown in one window]
    • plt.show()
    • plt.legendtouc
  • data = np.loadtxt(filename, delimiter=",", dtype=int) load each row as an element

  • and or conditionals

  • When starting a new repository, this is what I would do first. I would first make a branch called “repo_setup”. On this branch, I would create, add, and commit the following files to the repository:

    1. The “requirements.txt” file that contains at least “pytest” and “pytest-pycodestyle”. ("Jupyter","Numpy","matplotlib")
      • touch .gitignore [.github/.DS_Store, venv_name]
    2. Create the “.github/workflows/pytest_runner.yml” file.
    3. A python script file (example: cell_assay.py) that contains the single command “pass”. [new line at the end of the file] (The reason this file is included is because pytest will fail and cause GitHub Actions to fail if pytest cannot find any python files to analyze.)

    The presence of these three files should allow GitHub Actions to run and successfully pass. Once you see that GitHub Actions passes, merge this branch into the main branch. Then, all of these files will exist on all new branches in the future and GitHub Actions will run. You can then create a new branch and start adding your code to the python script that you created. 4. Return to the main folder of the repository and git add, commit, push

    1. Create venv 0. create a new branch to setup venv
      1. python -m venv myvenv [on Mac][create virtual environment]
      2. source virtual_env_name/bin/activate [activate the env]
      3. pip install -r requirements.txt
      4. git commit
      5. git push branch
      6. deactivate
  • codes to keep the program running:

    while True: [keep the program running] result = read_person_info(first_line, in_file) first_line = result if result is False: break [break the loop]

Lec Feb 22__Functional Decomposition

  • write flow chart:

    • decompose functions [F271][F272]
  • assignment6

    • breath time is not important, just make it consistent
    • license: 40:52 in video [a software license with your project (http://choosealicense.com/)]

Lec Feb 24__IDEs/Dubugging

  • dubbugger
  • VS code:
  • clone a repo
  • open an existing repo
  • select Python interpreter
  • create venv
  • create new branch
  • step into/over/out
  • commit
  • debugger:
    • how to find error:
      • debug the file without breakpoint
      • get the lightning sign
      • set a breakpoint at the lightning sign
      • set the condition
      • resume the program
  • debug:
  • set breakpoint
  • run debugger

Lec March 1__Requests and Quick Flask

  • The client: A client is anyone (in our case, a computer program on our local computer) that wants to interact or share information with a web server or service.
  • Web Server or Service: A web server or service is a program running on a separate machine (another computer, a cloud computer, or a computer cluster).
  • API: An Application Programming Interface (API) provides instructions and/or an interface for accessing the functionality of a program or package.
  • RESTful APIs: APIs in the cloud and are how we interact with web servers. [Making requests on the web] [Instead of installing and calling packages on our local computer, RESTful APIs allow us to call functionality in the cloud (on a server).]
  • Requests: A client makes a "request" of the server in order to interact.
  • Response: The server sends a "response" back to the client after a request. This response is often encoded as a JSON string. JSON strings are a standard method for information interchange over the internet.
  • URL: uniform resource locator [Calls to web services are done through URLs][URLs can be thought of as a function name to access a particular functionality of a server.]
  • endpoint
  • two main types of requests:
  • print(r) [ex: Response [200]]
  • r.status_code [ex: 200][response code][tells you whether the request is good or not][ex: 404 tells you doesn't exist][200: status for good request] --[Always print that to make sure everything is good]
  • r.text [tells us what comes back from the server]
  • branches = r.json() [decode the JSON text string into a Python native variable type]__[always debug the file here to see what will be printed out]

GitHub Copilot;

  • Pycharm: See suggestions on a new tab: press Command+Shift+A, then click Open GitHub Copilot, or press Command+Shift+\ to open the new tab immediately
  • describe something you want to do using natural language within a comment
  • VS code: To open a new tab with multiple additional options, press Ctrl+Enter.

Lec March 3__

  • app = Flask(name)

127.0.0.1 - - [16/Mar/2023 20:23:01] "GET / HTTP/1.1" 200 - [record of getting request from the URL] 127.0.0.1 - - [16/Mar/2023 20:23:02] "GET /favicon.ico HTTP/1.1" 404 - [the error tells us we didn't set up an icon for the website]

  • pycharm suggestions: main->if name == "main"
  • @app.route("/", methods=["GET"]) [decorator]
  • basic code for writing a server:
  1. define a variable that access the flask
  2. run it
  3. define what you want for each route to look like using decorator
from flask import Flask

app = Flask(__name__)


@app.route("/", methods=["GET"])
def server_status():
    return "Server On"


if __name__ == '__main__':
    app.run()
  • always stop previous server code after make changes to your server code
  • servers are event-driven systems: don‘t know what order you will go into
  • Flask Handler: functions decorated with the @app.route [it handles the request that comes to that route]

    1. get input
    2. call other functions
    3. return a response
  • run the server, then run the client

  • np array need coma between elements, even if it is not shown on screen

  • always use jsonify [even you know it's string or list, it‘s easier]

  • the internet can only access the string, so whether put or get should be a string: server part: jsonify(answer) client part: r.json() [will tranfer r from a json string to a python variable] ex: @app.route("/add_two/<a>/<b>", methods=["GET"])[a, b are strings]

  • use status code to interact with the server to decide if the request is successful or not ex: return "The answer was less than zero. BAD", 400 if r.status_code == 200: x = r.json() print(x + 2)

Lec March 10__debug server

run server -> run client

assignment:

marking fakes

integrate the unit tests

Lec March 22__Virtual Machines, Screen, Intro to MongoDB/PyMODM

  • step to log in vm:

Lec March 24__Examples of MongoDB

  • if want to change the db:
    1. search first
    2. then change

Lec March 29__GUI

  • GUI: event-driven
  • run part code -> setup, then wait for event and give response
  • event include: type in text, move the mouse, etc
  • GUI Design:
    • make actions cancelable or reversible
    • provide frequent feedback
    • create cues about what should be done next:
      • Use checklists or label steps
      • Put the next action or input near to the previous action or input
    • Be consistent in how interface works
    • Avoid jargon
    • just a test for ssh sdsfd