chq-saleen 发表于 2019-6-4 16:47:28

新手求助,TXT文档提取相关字段,并保存入表格

本麻瓜想整理一些Google Schlolar下载的文献资料,保存为TXT格式,现在想把一些字段提取出来保存入Excel中,但是在提取字段中碰到问题,请大家帮忙。
我想提取高亮部分的字段(黄色),后附要求(绿色)。


PMID- 31131053{PMID号}
OWN - NLM
STAT- In-Data-Review
LR- 20190527
IS- 1838-7640 (Electronic)
IS- 1838-7640 (Linking)
VI- 9
IP- 9
DP- 2019{年}
TI- Regulation of ezrin tension by S-nitrosylation mediates non-small cell lung
      cancer invasion and metastasis.
PG- 2555-2571
LID - 10.7150/thno.32479 【可能会有两个LID,选择后面有的】{DOI号}
AB- Cancer invasion and metastasis depend on accurate and rapid modulation of both
      chemical and mechanical activities. The S-nitrosylation (SNO) of membrane
      cytoskeletal cross-linker protein ezrin may regulate the malignant process in a
      tension-dependent manner. Methods: The level of nitrosylated ezrin in non-small
      cell lung cancer (NSCLC) tissues and A549 cell line were evaluated by
      biotin-switch assay. A few cysteine mutated plasmids of ezrin were used to
      identify active site for SNO. Newly designed ezrin or mutated-ezrin tension
      probes based on Forster resonance energy transfer (FRET) theory were applied to
      visually observe real-time tension changes. Cytoskeleton depolymerizing and motor
      molecular inhibiting experiments were performed to reveal the alternation of the
      mechanical property of ezrin after SNO. Transwell assays and xenograft mouse
      model were used to assess aggressiveness of A549 cells in different groups.
      Fluorescent staining was also applied to examine cellular location and
      structures. Results: High inducible nitric oxide synthase (iNOS) levels were
      observed to induce ezrin-SNO, and then promote malignant behaviors of NSCLC cells
      both in vitro and in vivo. Cys(117) was identified as the only active site for
      ezrin-SNO. Meanwhile, an increased level of ezrin tension was observed after
      iNOS-induced SNO. Enhanced ezrin tension was positively correlated with
      aggressiveness of NSCLC. Moreover, Microfilament (MF) forces instead of
      microtubule (MT) forces played dominant roles in modulating ezrin tension,
      especially after ezrin nitrosylation. Conclusion: This study revealed a
      SNO-associated mechanism underlying the mechanical tension of ezrin. Ezrin-SNO
      promotes NSCLC cells invasion and metastasis through facilitating mechanical
      transduction from the cytoskeleton to the membrane. These studies implicate the
      therapeutic potential by targeting ezrin in the inhibition NSCLC invasion and
      metastasis. {摘要}
FAU - Zhang, Xiaolong
AU- Zhang X
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Li, Guangming
AU- Li G
AD- Department of Anesthesiology, Huaian First People's Hospital, Nanjing Medical
      University, Huaian 223001, Jiangsu, PR China.
FAU - Guo, Yichen
AU- Guo Y
AD- Department of Surgery and Biomedical Engineering, University of Alabama at
      Birmingham (UAB), Birmingham, Alabama. 35294, USA.
FAU - Song, Ying
AU- Song Y
AD- Department of Respiratory Medicine, The First Affiliated Hospital of Nanjing
      Medical University, Nanjing 210029, PR China.
FAU - Chen, Linlin
AU- Chen L
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Ruan, Qinli
AU- Ruan Q
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Wang, Yifan
AU- Wang Y
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Sun, Lixia
AU- Sun L
AD- Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing 210023,
      Jiangsu, PR China.
FAU - Hu, Yunfeng
AU- Hu Y
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Zhou, Jingwen
AU- Zhou J
AD- The First Clinical Medical College of Nanjing University of Chinese Medicine,
      Nanjing, Jiangsu 210023, PR China.
FAU - Ren, Bin
AU- Ren B
AD- Department of Surgery and Biomedical Engineering, University of Alabama at
      Birmingham (UAB), Birmingham, Alabama. 35294, USA.
FAU - Guo, Jun【如果最后一个作者有2个单位,2个单位都要,但是可以放在同一个单元格中】{通讯作者全名(优先选择地址里面有邮箱的,否则就选择最后一个作者)}
AU- Guo J
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
      Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
      210023, Jiangsu, PR China. {通讯作者单位}
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
      University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
LA- eng
PT- Journal Article
DEP - 20190413{发表时间}
PL- Australia
TA- Theranostics
JT- Theranostics{期刊全名}
JID - 101552395
PMC - PMC6525990
OTO - NOTNLM
OT- FRET
OT- ezrin S-nitrosylation
OT- invasion
OT- metastasis
OT- non-small cell lung cancer【所有OT,可以放在同一个单元格中】{关键词}
COIS- Competing Interests: The authors have declared that no competing interest exists.
EDAT- 2019/05/28 06:00
MHDA- 2019/05/28 06:00
CRDT- 2019/05/28 06:00
PHST- 2018/12/21 00:00
PHST- 2019/03/18 00:00
PHST- 2019/05/28 06:00
PHST- 2019/05/28 06:00
PHST- 2019/05/28 06:00
AID - 10.7150/thno.32479 【可能会有两个AID,选择后面有的】{DOI号2}
AID - thnov09p2555
PST - epublish
SO- Theranostics. 2019 Apr 13;9(9):2555-2571. doi: 10.7150/thno.32479. eCollection
      2019. {引文信息}

wp231957 发表于 2019-6-4 17:21:30

那么多FAU 都选吗还是只选你框黄色的部分

wp231957 发表于 2019-6-4 18:36:22


s='''
PMID- 31131053{PMID号}
OWN - NLM
STAT- In-Data-Review
LR- 20190527
IS- 1838-7640 (Electronic)
IS- 1838-7640 (Linking)
VI- 9
IP- 9
DP- 2019{年}
TI- Regulation of ezrin tension by S-nitrosylation mediates non-small cell lung
       cancer invasion and metastasis.
PG- 2555-2571
LID - 10.7150/thno.32479 【可能会有两个LID,选择后面有的】{DOI号}
AB- Cancer invasion and metastasis depend on accurate and rapid modulation of both
       chemical and mechanical activities. The S-nitrosylation (SNO) of membrane
       cytoskeletal cross-linker protein ezrin may regulate the malignant process in a
       tension-dependent manner. Methods: The level of nitrosylated ezrin in non-small
       cell lung cancer (NSCLC) tissues and A549 cell line were evaluated by
       biotin-switch assay. A few cysteine mutated plasmids of ezrin were used to
       identify active site for SNO. Newly designed ezrin or mutated-ezrin tension
       probes based on Forster resonance energy transfer (FRET) theory were applied to
       visually observe real-time tension changes. Cytoskeleton depolymerizing and motor
       molecular inhibiting experiments were performed to reveal the alternation of the
       mechanical property of ezrin after SNO. Transwell assays and xenograft mouse
       model were used to assess aggressiveness of A549 cells in different groups.
       Fluorescent staining was also applied to examine cellular location and
       structures. Results: High inducible nitric oxide synthase (iNOS) levels were
       observed to induce ezrin-SNO, and then promote malignant behaviors of NSCLC cells
       both in vitro and in vivo. Cys(117) was identified as the only active site for
       ezrin-SNO. Meanwhile, an increased level of ezrin tension was observed after
       iNOS-induced SNO. Enhanced ezrin tension was positively correlated with
       aggressiveness of NSCLC. Moreover, Microfilament (MF) forces instead of
       microtubule (MT) forces played dominant roles in modulating ezrin tension,
       especially after ezrin nitrosylation. Conclusion: This study revealed a
       SNO-associated mechanism underlying the mechanical tension of ezrin. Ezrin-SNO
       promotes NSCLC cells invasion and metastasis through facilitating mechanical
       transduction from the cytoskeleton to the membrane. These studies implicate the
       therapeutic potential by targeting ezrin in the inhibition NSCLC invasion and
       metastasis. {摘要}
FAU - Zhang, Xiaolong
AU- Zhang X
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Li, Guangming
AU- Li G
AD- Department of Anesthesiology, Huaian First People's Hospital, Nanjing Medical
       University, Huaian 223001, Jiangsu, PR China.
FAU - Guo, Yichen
AU- Guo Y
AD- Department of Surgery and Biomedical Engineering, University of Alabama at
       Birmingham (UAB), Birmingham, Alabama. 35294, USA.
FAU - Song, Ying
AU- Song Y
AD- Department of Respiratory Medicine, The First Affiliated Hospital of Nanjing
       Medical University, Nanjing 210029, PR China.
FAU - Chen, Linlin
AU- Chen L
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Ruan, Qinli
AU- Ruan Q
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Wang, Yifan
AU- Wang Y
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Sun, Lixia
AU- Sun L
AD- Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing 210023,
       Jiangsu, PR China.
FAU - Hu, Yunfeng
AU- Hu Y
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
FAU - Zhou, Jingwen
AU- Zhou J
AD- The First Clinical Medical College of Nanjing University of Chinese Medicine,
       Nanjing, Jiangsu 210023, PR China.
FAU - Ren, Bin
AU- Ren B
AD- Department of Surgery and Biomedical Engineering, University of Alabama at
       Birmingham (UAB), Birmingham, Alabama. 35294, USA.
FAU - Guo, Jun【如果最后一个作者有2个单位,2个单位都要,但是可以放在同一个单元格中】{通讯作者全名(优先选择地址里面有邮箱的,否则就选择最后一个作者)}
AU- Guo J
AD- State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China. {通讯作者单位}
AD- Key Laboratory of Drug Target and Drug for Degenerative Disease, Nanjing
       University of Chinese Medicine, Nanjing 210023, Jiangsu, PR China.
LA- eng
PT- Journal Article
DEP - 20190413{发表时间}
PL- Australia
TA- Theranostics
JT- Theranostics{期刊全名}
JID - 101552395
PMC - PMC6525990
OTO - NOTNLM
OT- FRET
OT- ezrin S-nitrosylation
OT- invasion
OT- metastasis
OT- non-small cell lung cancer【所有OT,可以放在同一个单元格中】{关键词}
COIS- Competing Interests: The authors have declared that no competing interest exists.
EDAT- 2019/05/28 06:00
MHDA- 2019/05/28 06:00
CRDT- 2019/05/28 06:00
PHST- 2018/12/21 00:00
PHST- 2019/03/18 00:00
PHST- 2019/05/28 06:00
PHST- 2019/05/28 06:00
PHST- 2019/05/28 06:00
AID - 10.7150/thno.32479 【可能会有两个AID,选择后面有的】{DOI号2}
AID - thnov09p2555
PST - epublish
SO- Theranostics. 2019 Apr 13;9(9):2555-2571. doi: 10.7150/thno.32479. eCollection
       2019. {引文信息}
'''

import re

pmid=re.search("PMID- (\d+){",s)
print(pmid.group(1))
dp=re.search("DP\s+- (\d+){",s)
print(dp.group(1))
ti= re.search("TI\s+- ([\sa-zA-Z-\.]+)PG",s,re.M)
print(ti.group(1))
lid=re.search("LID\s+- ([\sa-zA-Z-\.\\0-9]+)\[",s)
print(lid.group(1))
ab=re.search("AB\s+- ([\sa-zA-Z-\.\\0-9\(\),:]+){",s,re.M)
print(ab.group(1))
fau=re.findall("FAU\s+- ([\sa-zA-Z,]+)$",s,re.M)
for x in fau:print(x)
ad= re.search("AD\s+- ([\sa-zA-Z0-9-\.,]+){",s,re.M)
print(ad.group(1))
dep=re.search("DEP - (\d+){",s)
print(dep.group(1))
jt=re.search("JT\s+- (+){",s)
print(jt.group(1))
ot=re.findall("OT\s+- (+)$",s,re.M)
for x in ot:print(x)
ot2=re.search("OT\s+- (+\-+\s++\s++\s++)",s)
print(ot2.group(1))
aid=re.findall("AID\s+- ([\sa-z0-9\.\/]+)",s)
for x in aid:print(x)
so=re.search("SO\s+- ([\sA-Za-z0-9\.\/:;\(\)-]+)",s,re.M)
print(so.group(1))





wp231957 发表于 2019-6-4 18:36:56

e:\>python ex3.py
31131053
2019
Regulation of ezrin tension by S-nitrosylation mediates non-small cell lung
       cancer invasion and metastasis.

10.7150/thno.32479
Cancer invasion and metastasis depend on accurate and rapid modulation of both
       chemical and mechanical activities. The S-nitrosylation (SNO) of membrane
       cytoskeletal cross-linker protein ezrin may regulate the malignant process in a
       tension-dependent manner. Methods: The level of nitrosylated ezrin in non-small
       cell lung cancer (NSCLC) tissues and A549 cell line were evaluated by
       biotin-switch assay. A few cysteine mutated plasmids of ezrin were used to
       identify active site for SNO. Newly designed ezrin or mutated-ezrin tension
       probes based on Forster resonance energy transfer (FRET) theory were applied to
       visually observe real-time tension changes. Cytoskeleton depolymerizing and motor
       molecular inhibiting experiments were performed to reveal the alternation of the
       mechanical property of ezrin after SNO. Transwell assays and xenograft mouse
       model were used to assess aggressiveness of A549 cells in different groups.
       Fluorescent staining was also applied to examine cellular location and
       structures. Results: High inducible nitric oxide synthase (iNOS) levels were
       observed to induce ezrin-SNO, and then promote malignant behaviors of NSCLC cells
       both in vitro and in vivo. Cys(117) was identified as the only active site for
       ezrin-SNO. Meanwhile, an increased level of ezrin tension was observed after
       iNOS-induced SNO. Enhanced ezrin tension was positively correlated with
       aggressiveness of NSCLC. Moreover, Microfilament (MF) forces instead of
       microtubule (MT) forces played dominant roles in modulating ezrin tension,
       especially after ezrin nitrosylation. Conclusion: This study revealed a
       SNO-associated mechanism underlying the mechanical tension of ezrin. Ezrin-SNO
       promotes NSCLC cells invasion and metastasis through facilitating mechanical
       transduction from the cytoskeleton to the membrane. These studies implicate the
       therapeutic potential by targeting ezrin in the inhibition NSCLC invasion and
       metastasis.
Zhang, Xiaolong
Li, Guangming
Guo, Yichen
Song, Ying
Chen, Linlin
Ruan, Qinli
Wang, Yifan
Sun, Lixia
Hu, Yunfeng
Zhou, Jingwen
Ren, Bin
State Key Laboratory Cultivation Base for TCM Quality and Efficacy, School of
       Medicine and Life Science, Nanjing University of Chinese Medicine, Nanjing
       210023, Jiangsu, PR China.
20190413
Theranostics
FRET
ezrin S-nitrosylation
invasion
metastasis
non-small cell lung cancer
10.7150/thno.32479
thnov09p2555
Theranostics. 2019 Apr 13;9(9):2555-2571. doi: 10.7150/thno.32479. eCollection
       2019.

chq-saleen 发表于 2019-6-5 09:43:02

wp231957 发表于 2019-6-4 18:36
e:\>python ex3.py
31131053
2019

非常感谢{:5_92:}。关于前面你提到的作者,是这样的要求:优先选择地址里面有邮箱的,否则就选择最后一个作者

chq-saleen 发表于 2019-6-13 11:20:29

wp231957 发表于 2019-6-4 18:36
e:\>python ex3.py
31131053
2019


非常感谢。关于前面你提到的作者和地址,是这样的要求:优先选择地址里面有邮箱的地址和作者,否则就选择最后一个地址和作者
页: [1]
查看完整版本: 新手求助,TXT文档提取相关字段,并保存入表格