May
May week5
*research(ai/bio/nlp-papers/lectrues/projects)
** upload zenodo
*class->candiexam
MAYBE FUTURE WORKS
** Ideation(with conference?)-please keep the ideas...
*** qiita + norm? -> meta + emb (->llm)
*** unlearning?
** fair vllm?
** LLM app for gene (wait?)
** advAttack_verification and geneLMs (wait?)
* future...research
** iterative alignment algo
** paper focus: E-NLP(??description?)
** Aug focus(B)
** deep hierarchy
(future pro+nu)
** generative benchmark
(gf+rm+dc--->nucl)
(check nucl with blast--->check real nicl whether sim or not)
** using drug bank web or db(good for using)
## eng v/q/w first and.. ( reading/listening/speaking/writing(paper) )
## math? code? (lecture/assignment/project) Monday, May 26, 2025
- class
- research
o
Tuesday, May 27, 2025
- class
- research
o
Wednesday, May 28, 2025
- class
- research
o
Thursday, May 29, 2025
- class
- research
o
Friday, May 30, 2025
- class
-
research
o - meta analysis
** how close to predict(concentration), swap PAA and CL
** make ppt
Saturday, May 31, 2025
- class
- research
o
[now]
- candi
- subsidy
[others..later]
- set attention? ** OTU based prediction->not practical–>maybe add tree number for xgb and rf
** pedict diabetes?
** dif gen lm?
** OTU + meta + LLM
- 학회에서의 인사이트들?
이미 clip, llm 다 씀 for med ai..wow
contrastive learning, generation for aug 등 생각할만한 거 다 하는 느낌..
bio도 금방일듯
federated learning 좀 흥미로웠네(학습한 피처만 보내는거..중앙으로..병원등 개인정보땜에)
unlearning-> 잊게 만드는거.. 요즘 많이 다루는 분야인듯(이것도 개인정보 관련)
diffusion기반(랜덤기반) 디엔에이?
co-scientist? multi agent..llms with other apis(collaborator)..별건아닌거같아..
새로운 신기한거 많네 causal model들이나 hypergraph? heterophilly?
평소같으면 전혀 안 찾아볼 것들 보게되는 것도 아주 굿
그래프네트워크랑 언어모델이랑(트랜스포머)를 함께 사용하는 접근법도 아주 많네..!
트랜스포머에 모종의 확률의 추가하여 약간의 변화를 주는 방법도 많은듯
시퀀스랑 텍스트랑도 많이들 생각하는…
fidelity? 충실도?
fidelity는 모델이 원래 데이터, 원본 모델, 혹은 목표 함수와 얼마나 잘 일치하는지를 나타내는 개념이며, 다양한 머신러닝 응용에서 중요한 품질 평가 기준이 됩니다.
->여러 메트릭을 포함하는 개념인듯.. comsine sim, KL divergence, robust accuracy, 등등
-여기서부터 이어서… 이미지로부터 dna seq추출 신기(사실은 seq분류였음)
이미 rna모델은 엄청 많이 다루네.. 미생물로 확장해서 소타보다 강함을 보이거나
특이한 테스크로 프리트레인하거나 뭐 k mer에 스트라이드를 줘서 최적화하거나…
vector quantization으로 토큰화하는거 진짜 신기하네.. 이걸로 protein 생성모델 만드는거 오오 신기
E(3)-동변량 그래프 신경망(EGNN) 라는 단백질 3d구조 기반의(기울기 등 space정보 포함) 벡터롤 또 얻어서 PLM 벡터랑 합쳐서 뭐 예측 모델에 사용하기도 하네.. 이것도 특이….
rna LM + protein LM도 있는데 각각의 임베딩을 다시 입력으로 받아서 트랜스포머 써서 특정 테스크를 위한 파인튜닝하여 새로 임베딩 얻어서 사용하는… 그것도 인상적이었음
- 테스크 인사이트??
** Drug Target Discovery (유전자 발현 수준에서 특정 바이오마커를 찾는??)
** Disease Trajectory Reconstruction Using EHRs(유전체 데이터(GWAS, RNA-seq)를 포함할 수도..유전자 변이나 발현 패턴이 질병 진행과 연관될 경우)
** identifying pathogen(병원체)
** 좀 다르지만 생물학 문제(뭐 수능문제같은거)로 LLM평가해도 될 듯 … novelty랑 creativity랑 좀 더 depth한 개념들 갖고…
** 바로 위와 연관지어서 라우팅으로 생물과 화학을 나눈다던지, 또 연산부분은 나눈다던지 등..
** 또 좀 다르지만 t-SNE 대체재 또는 보완재로 UMAP, NCVis, It-SNE, 그리고 PaCMAP(with Differential Expression of Genes Between Clusters), LocalMAP
** interpretable neural networks..ProtoPNet? 이건 잘 모르겠군 도입하기가 ㅎㅎ
** 연관해서 causual 연구들도 관련이 있을듯… 분야가 확실히 많더라
May week4
*research(ai/bio/nlp-papers/lectrues/projects)
** upload zenodo
*class->candiexam
MAYBE FUTURE WORKS
** Ideation(with conference?)-please keep the ideas...
*** qiita + norm? -> meta + emb (->llm)
*** unlearning?
** fair vllm?
** LLM app for gene (wait?)
** advAttack_verification and geneLMs (wait?)
* future...research
** iterative alignment algo
** paper focus: E-NLP(??description?)
** Aug focus(B)
** deep hierarchy
(future pro+nu)
** generative benchmark
(gf+rm+dc--->nucl)
(check nucl with blast--->check real nicl whether sim or not)
** using drug bank web or db(good for using)
## eng v/q/w first and.. ( reading/listening/speaking/writing(paper) )
## math? code? (lecture/assignment/project) Monday, May 19, 2025
- class
- research
o
Tuesday, May 20, 2025
- class
- research
o
Wednesday, May 21, 2025
- class
- research
o
Thursday, May 22, 2025
- class
- research
o
mfe
블라스트
Friday, May 23, 2025
- class
- research
o
Saturday, May 24, 2025
- class
- research
o
test 10 times? and format genbio and add exps(TODO)
https://www.sciencebase.gov/catalog/item/5fe22dead34e30b9123f09b5
Sunday, May 25, 2025
- class
- research
o
[now]
- test 10 times? and format genbio and add exps(TODO) (2) https://www.sciencebase.gov/catalog/item/5fe22dead34e30b9123f09b5
- cam red (2)
** include zenodo / acknowledgement - subsidy
[others..later]
-
meta analysis
** Replicate these experiments 10 times ** predict percentage different ** how close to predict(concentration), swap PAA and CL -
candi
-
set attention? ** OTU based prediction->not practical–>maybe add tree number for xgb and rf
** pedict diabetes?
** dif gen lm?
** OTU + meta + LLM
May week3
*research(ai/bio/nlp-papers/lectrues/projects)
** upload zenodo
** Ideation(with conference?)-please keep the ideas...
*** qiita + norm? -> meta + emb (->llm)
*** unlearning?
** fair vllm?
** LLM app for gene (wait?)
** advAttack_verification and geneLMs (wait?)
*class->candiexam
* future...research
** iterative alignment algo
** paper focus: E-NLP(??description?)
** Aug focus(B)
** deep hierarchy
(future pro+nu)
** generative benchmark
(gf+rm+dc--->nucl)
(check nucl with blast--->check real nicl whether sim or not)
** using drug bank web or db(good for using)
## eng v/q/w first and.. ( reading/listening/speaking/writing(paper) )
## math? code? (lecture/assignment/project) Monday, May 12, 2025
- class
- research
o
Tuesday, May 13, 2025
- class
- research
o
Wednesday, May 14, 2025
- class
- research
o
Thursday, May 15, 2025
- class
- research
o
exp describe
ref
git..
Friday, May 16, 2025
- class
- research
o
Saturday, May 17, 2025
- class
- research
o
Sunday, May 18, 2025
- class
-
research
o - align with meta, OTU, and seqs
- meta + LLM prediction–> gemma,llama,qwen? –> not practical i think..
** with small sample->OK..
** (* predict soil-> * sample_type,scientific_name …., abundance, Quantitative Microbial Risk Assessment ) - three papers report
[now]
-
algorithm for adv
- predict percentage different
-
how close to predict(concentration), swap PAA and CL
-
OTU based prediction->not practical–>maybe add tree number for xgb and rf
- pedict diabetes?
-
dif gen lm?
- OTU + meta + LLM
May week2
*research(ai/bio/nlp-papers/lectrues/projects)
** upload zenodo
** Ideation(with conference?)-please keep the ideas...
*** qiita + norm? -> meta + emb (->llm)
*** unlearning?
** fair vllm?
** LLM app for gene (wait?)
** advAttack_verification and geneLMs (wait?)
*class->candiexam
* future...research
** iterative alignment algo
** paper focus: E-NLP(??description?)
** Aug focus(B)
** deep hierarchy
(future pro+nu)
** generative benchmark
(gf+rm+dc--->nucl)
(check nucl with blast--->check real nicl whether sim or not)
** using drug bank web or db(good for using)
## eng v/q/w first and.. ( reading/listening/speaking/writing(paper) )
## math? code? (lecture/assignment/project) Monday, May 05, 2025
- class
- research
o
Tuesday, May 06, 2025
- class
- research
o
Wednesday, May 07, 2025
- class
- research
o
Thursday, May 08, 2025
- class
- research
o
Friday, May 09, 2025
- class
- research
o
Saturday, May 10, 2025
- class
- research
o
Sunday, May 11, 2025
- class
-
research
o - align with meta, OTU, and seqs
- meta + LLM prediction–> gemma,llama,qwen? –> not practical i think..
** with small sample->OK..
—————————-
[now]
** (* predict soil-> * sample_type,scientific_name …., abundance, Quantitative Microbial Risk Assessment )
- algorithm for adv
-
three papers report
-
OTU based prediction->not practical–>maybe add tree number for xgb and rf
- pedict diabetes?
-
dif gen lm?
- OTU + meta + LLM
May week1
*research(ai/bio/nlp-papers/lectrues/projects)
** upload zenodo
** Ideation(with conference?)-please keep the ideas...
*** qiita + norm? -> meta + emb (->llm)
*** unlearning?
** fair vllm?
** LLM app for gene (wait?)
** advAttack_verification and geneLMs (wait?)
*class->candiexam
* future...research
** iterative alignment algo
** paper focus: E-NLP(??description?)
** Aug focus(B)
** deep hierarchy
(future pro+nu)
** generative benchmark
(gf+rm+dc--->nucl)
(check nucl with blast--->check real nicl whether sim or not)
** using drug bank web or db(good for using)
## eng v/q/w first and.. ( reading/listening/speaking/writing(paper) )
## math? code? (lecture/assignment/project) Thursday, May 01, 2025
- class
- research
o
Friday, May 02, 2025
- class
- research
o
Saturday, May 03, 2025
- class
- research
o
Sunday, May 04, 2025
- class
-
research
o - align with meta, OTU, and seqs
-
meta + LLM prediction–> gemma,llama,qwen? –> not practical i think..
** with small sample->OK..
—————————-
** (* predict soil-> * sample_type,scientific_name …., abundance, Quantitative Microbial Risk Assessment ) -
OTU based prediction->not practical–>maybe add tree number for xgb and rf
- pedict diabetes?
-
dif gen lm?
- OTU + meta + LLM