Plans

Todo - AZT

A-Z+T 1.0 release stoppers

This is the A−Z+T feature set required for largely independent use from start to Alphabet chart, to tone data collection

Output Alpabet chart with
- one default set of characters to display (all consonant and vowel groups, which show up anywhere)
- one default alphabet order (from Unicode/ANSI/ASCII?)
- one default layout (maybe math from 8.5/11-ish, so it depends on how many letters to show)
- a means to select words (pictures having already been selected)
Decide how to address sorting and reporting with obligatory morphology, implement and document for users.
French translation

A-Z+T 1.0 Bugs

Version 1.1+

Tone Frame exemplification

put real lexical examples, in place of < x lang word>
put 'change word' button on the top.
- will need to work in the case of asking for gloss languages not in the lexicon
  - just list in this case?
  - unselect least populous gloss? I don't like this
  - list upfront the number of glosses in each lang: do this in any case
    - maybe also the number of entries with this, that, and the other glosses.
finish each line with a colon (?), followed by example ?as currently formatted
- Not sure how well this would work

New Reports

pssubclass report
alphabet report/chart

Data collection

consider collecting recordings first, then first transcription.

Papercuts

Document background preparation to do

Look at parts of speech, and decide what are likely good frames
Either based on the family, related languages, or on what is known if the language itself.
Think through recording needs and equipment, including environment and training required.

Hardware

Can we facilitate the purchase of a lot for people who can front the money?
- Pi w keyboard: check out current options for CPU, ram
- Projector usb-C, ?with battery backup
- Battery: large capacity, usb-C
- <$200
- Mouse

Transcriber

pause between syllables
There's a difference between words, makes shakes now pronounced
inappropriate tweak between adjacent letters of same value (should be even tone)

Project organization and status

Table somewhere to report status on a higher level

ps v profile
some 'done' value for each of C, V, and tone
- Sort: number of checks with
  - no tosort
  - all verified groups
  - no tojoin
- Name: number of checks with
  - no integer groups
- Record: number of checks with
  - recorded True?
if group names listed, should be in same order
allows click to go there (could replace task chooser)

Consider a lift merge function in lift.py,

take a three way merge between lift files, maybe with origin.
For each guide, check everything not in a sense, for each sense check is intervals, except for identifiers for examples.
Wherever something has added to one, add to outcome. Do easy stuff first, then limit to others.
The stuff I was looking at today should be done automatically.*

Alphabet chart example selection

start with pictured, then with S1=S2, etc
Give a scrollbar of buttons, one for each letter
Letter buttons organized like group buttons, to scroll through them until you like one
Ultimately output to Jason or whatever the alpha program uses, also to file for loading later to keep working, and for posterity

AZT 2.0 step 1

On clicking "sort", put up a brief set of instructions with a QR code, instructions everyone to go there on their phones.
Sort then, according to 2.0 rules. Until everyone is happy, then move on
At first, should probably click for each slice of sorting
Could have in instruction wall what is being sorted on, in case anyone gets lost, or forgets the basic rules

Starting a database in a new related language

Add second language
Keep first language as a second analysis language, or as a gloss? (Have both as options?)
Once second analysis is added, can we just switch and continue?
Think about outputs, including comparison tables
Long term, do I want this as an option (to analyze multiple languages in one database)
Or should I rather work on tools to help comparison?
Dialect analysis considerations
- Glosslang frames could not change, neither in the frame, nor in a given example (without causing problems for the other analang, as there is just one value per glosslang)
- so if name and glossing are not the same, make a different frame
- This would only apply in a multilingual dictionary when multiple languages have forms in an entry, sharing one or more sense. In this case, glossing should be the same, though all the form fields (lx,lc,pl,imp) could have different values by language
  - if two languages do not share an entry, they won't share and senses (and therefore won't share any examples), and can have independent glossing.
- Not sure if multiple senses just for sense variation between dialects is a good idea.
  - I don't know that there's a way to show which sense goes with which language.
    - could have one sense with forms in one language, and another sense with forms in the other.
      - this would require robust logic to not die on a sense missing a language form.
  - ?So if you need different definitions or glossing, you probably want different entries; bummer for comparative work.
- Also, if we elicit one language through another, at least some of those entries will require (sooner or later) at least tweaks to the glossing/defns. So whatever UI does this we need a way to split an entry by language when doing so, to preserve the original glossing for the other language.
  - 'which language do you want to change the gloss for?'
- Set up frames for dialect analysis
  - We can use the same tone frames if they work in each dialect, or
    - A frame would have multiple analang forms, which would simply be ignored when not requested (as either analang or glosslang for the other)
  - simply define other frames, and some will have some languages, and others would have others.
  - This would mean that frames would continue to not be coded for language, other than storing language frame coded by analang code.
  - To do this, would need a modify frame page, to add new analang data (should keep glosses unmodifiable)
    - This same page could change the frame name
- I should need to think through how to do multiple analangs in reports

Installation issues

Why is numpy not installing for win_amd64?
Copy modules to AZT modules dir for windows users
Figure out how to add icon to windows shortcut
- may be a naming character problem

Sort Orders

How to tell the computer in UI which should precede others
- Showing in order
- Where to store that
Frame Editor
- Change frame name?
- Would need to iterate across lift, for all profiles
- Do I want to allow modification of frame content?
- Copy of existing frame? Error on same name....
- Sourced by window listing frame names and forms
- Name change in toneframe, status, and lift
- N.B. : this will be powerful, and to be done with care, at it will change the location field for entries in your database.
- Maybe add a check against doing this when it would damage something?
- Lots of frames; is 24 much?
  - How to investigate these questions without multiplying so many frames?
  - How to show only relevant frames
- Header buttons should hide the column or row, come back on a "show all" button in corner

Mostly done stuff

Remaining parse fn

once second form is given, if parse is rejected, offer another parse with shorter root.
- preserve set of root hypotheses,
  - maybe order them, and
  - ask about them in order
  - or just exclude what was rejected, and evaluate for the best again.

UI issues

New settings (for morphology, sensitive to subcategories)

Store pairs of affixes from object

Functional Structure

Parsing objects/classes

Sort&Segments

Parsing needs to happen before CV sorting, but the the forms need to be kept in sync during sorting, or else the parsing will need to be redone
- CV parse: replace s in multiple forms - Use rx.split() re.sub, etc
- CV: Look at string replacement methods, see if any work for number of occurrences
- Use new and old values for replacement
  - use replace to take changes from one and put to the other?
  - Keep lx up to date with lc, pl imp
  - Try string.split in the old form, join with old and new as delimiters, as appropriate (new.join(t.split(sep=old)) works)
Do something with stem type field

Affix collector (do often and cheaply)

Parser (different method of same class as above)

Affix storage catalog

Parser: access and store affix values in each LIFT sense

Each sense will already be marked for ps, read that
NO:Field[@type=' affixes']/form[@lang=analang]
in 'trait[@name="{}-infl-class"]'.format(self.psvalue())
values marked ~~for placement by -x/x-, so the parser could pick it up and use it correctly~~ in tuples (pfx,sfx).
Value will be picked up by affixes object
Value stored by parser object (via affix object method?)
- in LIFT
- in affix catalog

Method to draft root

Parser: parse from two forms

This will not change the data fields. It only populates/changes the analysis in the lx field and the affix fields, so mistakes here should always be recoverable.
Need to parse lx from lc and (pl or imp).
Populate affix fields (by calling or running before affix collector)
if lx (or lc) and second form, but no affix defns:
- If no lc, move lx to lc. This assumes lx was data, not analysis
- draft root

Procedure for one form only:

Method to build forms (check draft root against (two) forms)

~~## Parse I (should be part of parse UI – one option)

copy lc > pl/imp
this assumes NO obligatory morphology
this allows NO check for ps~~

~~## Parse II (Questionable value)

select 2 cuts
parse lc with gui buttons
maybe suggest pl/imp affixes on each?~~

Parse III

only works where lc has known affixes
GUI still requires "other" button
Access stored pairs of affixes
method to see which are possibly present in lc
construct possible alternate (pl/imp) forms for presentation to user