Atlas size, as in Feb. 2018
<
Data extracted from | http://atlasgeneticsoncology.org/stat_atlas.html | |||
http://atlasgeneticsoncology.org/Status/Status.html | ||||
http://documents.irevues.inist.fr/handle/2042/15655 | ||||
Summary | Number of Authors | 3 440 | from 46 countries | |
Number of Internet Pages | 47 067 | |||
Number of Printed Pages (very approx.) | 194 924 | |||
Number of Images | 33 258 | |||
Journal pdf: Number of Pages (for PubMed, Scopus) | 10 824 | |||
Size in Kb (journal excluded) | 6 145 849 | Note: still very light! |
NUMBER of INTERNET PAGES
Number | |
Genes (html) | 32 858 |
Leukemias (html) | 1 900 |
Tumors (html) | 10 828 |
Kprones (html) | 180 |
Reports (html) | 99 |
Deep (html) | 123 |
Educ (html) | 136 |
Others | |
Societies (html) | 24 |
Events (html) | 8 |
Categories (html) | 70 |
ICD-O (html) | 97 |
Indexes | |
Gene Index (html) | 47 |
Chromosome Index (html) | 99 |
Chromosomal bands Index (html) | 445 |
Atlas status (html) | 84 |
Management and Archives | |
Collaborative files (html) | 66 |
TOTAL Number of Internet Pages | 47 064 |
NUMBER of PRINTED PAGES (approximately)
Nb of files | Nb of printed pages | count1 | comments | ||
Genes0 (txt) | 1 506 | 19 578 | 13 | Annoted genes: 10 pages each at least + external links 3 pages = 13 pages each | |
Genes (txt) | 31 348 | 125 392 | 4 | Non-annoted genes: 4 pages each | |
1 460 | 730 annoted Leukemias (2 pages each, at least) | ||||
Leukemias (txt) | 1 626 | 2 360 | 900 | and 900 not annoted Leukemias (1 page each) | |
1 290 | 215 annoted Tumors (6 pages each, at least) | ||||
Tumors (txt) | 9 042 | 10 120 | 8 830 | and 8830 not annoted Tumors (1 page each) | |
Kprones (txt) | 115 | 805 | 7 | pages each? | |
Reports (txt) | 92 | 276 | 3 | pages each | |
Deep (html) | 123 | 1 353 | 11 | pages each | |
Educ (html) | 136 | 680 | 5 | pages each ???? | |
Others | |||||
Societies (html) | 24 | 0 | small number | ||
Events (html) | 8 | 0 | small number | ||
Categories (html) | 70 | 400 | 10 | tested on DNA replication (some more frequent, some less) | |
ICD-O (html) | 97 | 176 | 8 | tested on Head and Neck, and 11 system, + "All" | |
Indexes | |||||
Gene Index (html) | 47 | 520 | 20 | tested on "F", and there are 26 letters, some more frequent than "F", some less | |
Chromosome Index (html) | 99 | 600 | 25 | tested on chromosome 14 , and 24 chrom. | |
Chromosomal bands Index (html) | 445 | 21 840 | 70 | tested on 14q21, and 312 bands in the karyo. | |
Atlas status (html) | 84 | high number, e.g. http://atlasgeneticsoncology.org/Status/Status_Leukemia.html | |||
Management and Archives | |||||
Collaborative files (html) | 66 | not counted | |||
not counted: other pages | not counted: other pages (e.g. Recent.html, Tumors/Tumorliste.html ...) | ||||
Journal pdf | 10 824 | Journal: http://documents.irevues.inist.fr/handle/2042/15655 | |||
TOTAL Nb of Printed Pages | 194 924 | Note: very approximate; likely to be much more |
NUMBER of IMAGES
Number | |
map hg38 (png) | 27 543 |
Genes (img) | 3 112 |
Leukemias (img) | 961 |
Tumors (img) | 328 |
Kprones (img) | 124 |
Reports (img) | 187 |
Deep (img) | 473 |
Educ (img) | 482 |
and Chrom. Pages, and others | 48 |
TOTAL Nb of Images | 33 258 |
JOURNAL pdf
data from http://documents.irevues.inist.fr/handle/2042/15655
Year (Nb of issues) | Nb of Pages | |||
1997 (2 N°) | 105 | |||
1998 (4 N°) | 160 | |||
1999 (4 N°) | 224 | |||
2000 (4 N°) | 247 | |||
2001 (4 N°) | 307 | |||
2002 (4 N°) | 349 | |||
2003 (4 N°) | 309 | |||
2004 (4 N°) | 362 | |||
2005 (4 N°) | 369 | |||
2006 (4 N°) | 311 | |||
2007 (4 N°) | 364 | |||
2008 (6 N° +suppl.) | 569 | 487 | 82 | |
2009 (12 N°) | 991 | |||
2010 (12 N°) | 1 192 | |||
2011 (12 N°) | 1 089 | |||
2012 (12 N°) | 947 | |||
2013 (12 N°) | 869 | |||
2014 (12 N°) | 964 | |||
2015 (12 N°) | 717 | |||
2016 (12 N°, in progress) | 379 | (6 of 12 | N° done) | |
2017 (12 N°, in progress) | 0 | |||
2018 (in progress) | 0 | |||
TOTAL | 10 824 |
NUMBER of AUTHORS by COUNTRY
Country | Number of Authors |
USA | 972 |
France | 401 |
Italy | 291 |
Japan | 195 |
Spain | 178 |
Germany | 172 |
UK | 153 |
Canada | 145 |
China | 138 |
The Netherlands | 98 |
Brazil | 90 |
Australia | 69 |
India | 67 |
Belgium | 66 |
Sweden | 61 |
Greece | 58 |
Turkey | 55 |
Switzerland | 29 |
Mexico | 28 |
Taiwan | 26 |
Argentine | 24 |
Portugal | 23 |
Poland | 23 |
Russia | 22 |
Denmark | 21 |
Korea | 18 |
Israel | 17 |
Austria | 17 |
Finland | 16 |
Tunisia | 14 |
Norway | 14 |
Ireland | 12 |
Iran | 10 |
South Africa | 8 |
Singapore | 8 |
Bulgaria | 7 |
Slovenia | 6 |
Serbia | 6 |
Kuwait | 6 |
Czech Republic | 5 |
New Zealand | 3 |
Croatia | 3 |
Egypte | 2 |
Chili | 2 |
Slovakia | 1 |
Romania | 1 |
Soon, a new website, see: https://atlasgeneticsoncology.usal.es/v2.2/
SECTION EDITORS of the ATLAS
I- EDITORIAL BOARD of the ATLAS / SECTION EDITORS
II- ROLE of a SECTION EDITOR in the ATLAS
III- MISSING SECTION EDITORS
I- EDITORIAL BOARD
see http://atlasgeneticsoncology.org/BackpageAbout.html#EDITORIAL
II- ROLE of a SECTION EDITOR in the GENES topics
The Section Editor has to make a list of genes related to his/her Section (e.g. "Cell cycle" for the page http://atlasgeneticsoncology.org/Categories/Cell_cycle.html).
The Section Editor has to find authors for the most important genes in his Section, or write papers himself with his team.
Old papers alos have to be updated.
At least 5 to 10 papers must be produced each year.
REVIEW PROCESS: As the Section Editor chooses the invited authors on the grounds of their:
I-2.1. knowledge in the matter;
I-2.2. ability to write i) fully documented papers; ii) in a very didactic manner,
there is, most of the time, no need for external referring (the 2 referees being the Section Editor and one of the Editor in Chiefs).
Inasmuch as there are guidelines/submissions forms (templates) to be strictly used (e.g. http://atlasgeneticsoncology.org/Forms/Gene_Form_for_submission.doc submission on-line soon ), so that authors cannot miss an important item, either in the clinic part or in the molecular one.
III- MISSING SECTION EDITORS in the GENES and CELL BIOLOGY topics
The Atlas is in need of Section Editors; candidates are welcome for the following sections:
- Cytoskeleton/Scaffold
- DNA repair
- DNA replication-recombination
- Endocytosis/Exocytosis/Vesicle traffic
- Micro RNA/Other Non-coding DNA
- Epithelial-mesenchymal transition
- Immunity
- Inflammation
- Nuclear membrane trafficking
- Oxidative stress
- Proteasome/Ubiquitination
- Spliceosome/Transcription machinery/Translation
- Stem Cells
- Telomeres
Please, contact us: This email address is being protected from spambots. You need JavaScript enabled to view it. and/or This email address is being protected from spambots. You need JavaScript enabled to view it.
Cancer concerns one out of two men and one out of three women.
Cancer is a genetic disorder.
Pronostic and treatment depend on the genes involved.
BUT 2 000 to 9 000 genes may be involved in cancer!
- The Atlas of genetics in cancer is an encyclopedic database made of more than 45,000 pages written by more than 3,500 experts. It is also an international journal.
- The Atlas is an information resource for doctors, researchers, teachers and students, accessed by over 4 600 people for 5 500 visits each day, 1.1 million unique users a year.
- The Atlas is in free access on the Internet at: AtlasGeneticsOncology.org.
Project title |
'Atlas of Genetics and Cytogenetics in Oncology and Haematology' |
Summary |
The Atlas is a peer reviewed internet journal / encyclopaedia / database devoted to: - genes implicated in cancer, - cytogenetic or clinical entities in cancer, and - hereditary diseases which are cancer-prone conditions. Size of the Atlas in 2015: 45,500 pages; 3,500 authors; 5,500 visits every open day; 1.1 million unique users a year (USA 27%).
|
Key words |
- Pooling of knowledge concerning the biology of normal and cancerous cells, - Referential and investigation tool for research, - Multilingual pedagogical support, - Translational health research - Transfer of scientific innovation towards research itself, and, downstream, towards patient care.
|
Context |
25,000 new publications concerning cancer genetics in man are added each year in PubMed. No one, anymore, has the whole required knowledge, necessary to guide the treatment procedure in case of a rare disease. Huge database are therefore needed, to collect and summarize data on these rare diseases, and produce meta-analyses.
|
Objectifves of the project |
- Medical treatment assistance in rare forms of cancer, - Efficiency savings in the fight against cancer, - Decrease in fundamental and applied research as well as medical costs. - Personalised medicine for cancer (one of the axes of the cancer plan 2014-2018).
|
What is the Atlas?
Cancer Genetics, a fast developping field, merits more than 200,000 pages
Cancer, the second cause of death in the developed countries, is the first cause of untimely death. Prognosis of a leukaemia, for instance, depends on the genes involved... However, about 4,500 genes are possibly implicated in cancer! Therefore, there is a need for huge databases collecting and summarising the fast growing knowledge accumulated in cancer genetics (see, for more details: Why the Atlas?).
The Atlas is a peer-reviewed journal / encyclopaedia / internet database aimed at genes involved in cancer, cytogenetics and leukemias, solid tumours, and cancer-prone diseases. It also comprises educational items in genetics for students.
Structure of the Atlas
The Atlas is mainly made of structured review articles or "cards" (see above), but it also contains traditional overviews, a huge portal towards websites and databases devoted to cancer and/or genetics, case reports in haematology, and, as we have said, teaching items in various languages.
Cards are structured: e.g. cards on genes are made of the following paragraphs:
1- identity
2- DNA/RNA
3- protein
4- diseases where the gene is involved
5- external links
6- bibliography.
Deep Insights: these review articles are more traditional papers; they often deal with various themes at the outskirts of our core topic.
Portal: it comprises about 275 links:
1- general links (bibliography, general resources)
2- databases in genetics
3- resources in cancer genetics and cytogenetics
4- mapping resources
5- chromosomes maps (mouse, man)
6- functional genomic, transcriptome
7- journals.
A collective effort
The Atlas is a collective effort from researchers and clinicians to give the state of the art in cancer genetics to the medical and scientific community, to provide a cognitive tool for fundamental and clinical research. Readers of the Atlas are consultants at the hospital, researchers, university teachers, but it also reaches GP's, students in medicine, and students in sciences.
The genome project
The Atlas is part of the genome project and it participates in research on cancer epidemiology. It is at the crossroads of research, university and post-university teaching (e-learning, virtual medical university) and telemedicine, using new information and communication technologies. It contributes to meta-medicine, this mediation between the overflowing information provided by the scientific community and the individual practitioner. It helps the clinician in therapy decision. Its reputation is international.
The Atlas is free, in particular for third world countries.
- Founded in 1993,
the association ARMGHM - Atlas Génétique des Cancers - is a not-for-profit organization
under French law (law 1901), whose purpose is to host the Atlas of Cancer Genetics, to facilitate its editorial process and to ensure the propagation of its contents.
- It is necessary to sustain the Atlas. Funding is essential for the maintenance and continuous updating and development of the site.
- Over 90% of our operating expenses between 2008 and 2015 has gone to salary support for MSc or Phd Editorial Officers in charge of finding matters of study and their appropriate authors, and processing and editing papers (see Activity Reports).
The Atlas needs money,
to continue to assist medical and scientific community
PLEASE, MAKE A DONATION
... and If you use the Atlas regularly,
Become a member of the Association, for 30 euros or 40 dollars only
Use Paypal to pay and become a member. Specify in the "purpose" box that the payment is for membership.
Membership from cytogenetics labs are welcome,
for 100 euros or dollars (or much more, indeed)
We offer you a free service,
Please, help in return
http://atlasgeneticsoncology.org
Philippe Dessen (Database Director) This email address is being protected from spambots. You need JavaScript enabled to view it.
Jean Loup Huret (Editor) This email address is being protected from spambots. You need JavaScript enabled to view it.
May 2018
I- Main page:
II-1.
Foreword 1: There are various types of items developed in the Atlas:
1- Genes (http://atlasgeneticsoncology.org/Genes/XXX )
1-1. Annotated genes (papers/cards written by authors) URLs: http://atlasgeneticsoncology.org/Genes/[name-of-gene]ID[number]ch[location].html (1,493 papers/cards, e.g. http://atlasgeneticsoncology.org/Genes/PGRID41700ch11q22.html); and
1-2. Automated cards on genes (more or less like GeneCards); URLs: http://atlasgeneticsoncology.org/Genes/GC_[name-of-gene].html (28,377 cards);
2- Leukemias 681 annotated papers/cards : http://atlasgeneticsoncology.org/Anomalies/XXX , and 540 "Other Leukemias" (automated cards) : http://atlasgeneticsoncology.org/Anomalies/TL_XXX .
3- Solid tumors 217 annotated papers/card : http://atlasgeneticsoncology.org/Tumors/XXX , and 2,968 "Other Tumors" (automated cards) : http://atlasgeneticsoncology.org/Tumors/TT_XXX .
4- Cancer-prone diseases 114 annotated papers/cards : http://atlasgeneticsoncology.org/Kprones/XXX .
5- Case reports in hematology (http://atlasgeneticsoncology.org/Reports/XXX ) (88 papers/cards)
All these cards are structured from templates (e.g. Submission form for GENES: http://atlasgeneticsoncology.org/Forms/Gene_Form_for_submission.doc) with the addition of a HEADER with tags or tracking devices allowing for indexing of the form in different parts of the data base (e.g. TRI_PAR_CHROMOSOME -> to which chromosome page (red arrow)? CATEGORY-> to which Cell Biology page (red arrow)?); see also: http://atlasgeneticsoncology.org/Collab/catalog and http://chromosomesincancer.org/en/template-for-cards-papers.html ;
- and EXTERNAL LINKS (bottom of each paper/card).
There are also
- Deep Insights (traditional papers) http://atlasgeneticsoncology.org/Deep/XXX (113 Deep)
- Chromosome pages (http://atlasgeneticsoncology.org/Indexbychrom/idxa_[chromosome-number].html e.g. http://atlasgeneticsoncology.org/Indexbychrom/idxa_11.html ) and
- Chromosome band pages (http://atlasgeneticsoncology.org/Bands/[band].html e.g. http://atlasgeneticsoncology.org/Bands/19p13.html ),
- Cell biology pages (http://atlasgeneticsoncology.org/Categories/[category-name] e.g. http://atlasgeneticsoncology.org/Categories/Cell_cycle.html
- ICD-O pages (International Classification of Diseases - Oncology WHO/OMS) e.g. http://atlasgeneticsoncology.org/Tumors/Solid_Nosology.html and http://atlasgeneticsoncology.org/ICD/icd_2016_topo.html
- Atlas status (thesaurus of the Atlas: http://atlasgeneticsoncology.org/Status/Status.html and sub-pages)
- and various other pages (Backpage: http://atlasgeneticsoncology.org/BackpageAbout.html Recent papers http://atlasgeneticsoncology.org/Recent.html , Educational items http://atlasgeneticsoncology.org/GeneticEng.html , Genes partners, International cancer programs etc. (see others on the Main page)
A. Architecture of data:
A1. Text files for cards
Genes0
Anomalies (Leukemias)
Tumors
Kprones (Cancer-prone hereditary diseases)
Reports (Case Reports)
Deep (Deep Insight)
Educ (Educational Items)
The 2 last are defined by a couple of files (.meta + .htm)
A2. Chromosomal location
at the chromosome level
at the chomosomal band level
A3. Functional categories (Cell Biology)
A4 Count/Census of Atlas Items
Statistics on atlas files, see: http://atlasgeneticsoncology.org/Status/Status.html , and http://atlasgeneticsoncology.org/stat_atlas.html
A5. Catalogs and indexes
see in http://atlasgeneticsoncology.org/Collab/
catalog_full.txt (tabulated file with major informations from HEADER and IDENTITY blocs)
ObjDB0.txt ObjDB2.txt ObjDB4.txt ObjDB6.txt ObjDB.txt
ObjDB1.txt ObjDB3.txt ObjDB5.txt ObjDB7.txt: different files for indexing (might be simplified in a new structure)
A6. External resources
B. Modules for management and development
Module 1: Description of the templates of cards
see: http://atlasgeneticsoncology.org/Collab/Formes.xlsx ,
which correspond to the various submission forms for the authors:
http://atlasgeneticsoncology.org/Forms/Gene_Form_for_submission.doc
http://atlasgeneticsoncology.org/Forms/Leukemia_Form_for_submission.doc
http://atlasgeneticsoncology.org/Forms/Solid_Tumor_Form_for_submission.doc
http://atlasgeneticsoncology.org/Forms/Cancer-Prone_Disease_Form_for_submission.doc
Each card has a unique Atlas ID (present in the filename)
Genes: 1 --> 999 + 40000 --> 80000
Anomalies: 1001 --> 4999
Tumors: 5000 --> 9999
Kprones: 10000 --> 19999
Deep: 20000 --> 29999
Educ: 30000 --> 3999
Organisation of directories
All data is organized in two main directories
1. "cytatlas" (for managment)
2. " chromcancer" (with internet access)
1. cytatlas
./Genes0 (for expert txt)
./Genes (after txt processing)
./Anomalies
./Tumors
./Kprones
./Reports
./Deep
./Educ
Each directory has some other subdirectories for Images, xxLinks …
./Scripts (all bash and perl + references data)
2. chromcancer
./Genes
./Anomalies
./Tumors
./Kprones
./Reports
./Deep
./Educ
(each of the above with subdirectories for Images ..)
./Categories
./Indexbychrom (Chromosomes pages)
./Indexbyalpha
./Bands
./Status
./ICD
./ISCN
The process of scripts is in general made in the ./cytatlas/Scripts
The directory can be installed in a way defined by a general shell script
Cytatlas.sh which defines all the logical variables.
#!/usr/bin/bash
## script cytatlas_cygwin disque D
umask 002
CYT_DIR='/cygdrive/d/ATLAS/cytatlas'
CYTW_DIR='/cygdrive/d/ATLAS/chromcancer'
CYTHTML_DIR='http://genome.igr.fr/chromcancer'
SCRIPT_DIR='$CYT_DIR/Scripts'
PATH=$PATH:$CYT_DIR/Scripts
export CYT_DIR CYTW_DIR CYTHTML_DIR SCRIPT_DIR PATH
GENE_DIR=$CYT_DIR/Genes
ANOM_DIR=$CYT_DIR/Anomalies
TUMOR_DIR=$CYT_DIR/Tumors
KPRON_DIR=$CYT_DIR/Kprones
REPORT_DIR=$CYT_DIR/Reports
STUDY_DIR=$CYT_DIR/StudyGroup
DEEP_DIR=$CYT_DIR/Deep
EDUC_DIR=$HOME/DATA_DIR/Educ
WGENE_DIR=$CYTW_DIR/Genes
WANOM_DIR=$CYTW_DIR/Anomalies
WTUMOR_DIR=$CYTW_DIR/Tumors
WKPRON_DIR=$CYTW_DIR/Kprones
WREPORT_DIR=$CYTW_DIR/Reports
WDEEP_DIR=$CYTW_DIR/Deep
WEDUC_DIR=$CYTW_DIR/Educ
WCOLLAB_DIR=$CYTW_DIR/Collab
export GENE_DIR ANOM_DIR TUMOR_DIR KPRON_DIR
export REPORT_DIR STUDY_DIR DEEP_DIR EDUC_DIR
export WGENE_DIR WANOM_DIR WTUMOR_DIR WKPRON_DIR
export WREPORT_DIR WDEEP_DIR WEDUC_DIR WCOLLAB_DIR
alias cyt='cd $CYT_DIR'
alias cytw='cd $CYTW_DIR'
alias cyti='cd $CYT_DIR/Scripts'
echo 'cyt : $CYT_DIR'
echo 'cyti : $CYT_DIR/Scripts'
echo 'cytw : $CYTW_DIR'
echo 'www : $CYTHTML_DIR'
Module 2: Editorial management of cards
Foreword 2: Editorial process:
This is an important part, as the Editorial database processing must take it into account. See "Editorial workflow in the Atlas": http://chromosomesincancer.org/en/editorial-workflow.html .
In particular, critically important, Tables are used 1- to identify all/each relevant item (Table 1 herein below); 2- to dialogue with authors (Table 2).
These tables are the today ones used by the editor.
Examples:
NAME |
STATUS |
AUTHORS |
ID Atlas |
ICD-O3_MORPH |
ICD-O3_TOPO |
05;00§ tri 5/NHL or chronic Lympho |
FOR SALE |
C421,C424 |
|||
05;00§ MDS with isolated del (5q) |
DONE |
XX |
1134 |
9986/3 |
C421,C424 |
05;00§ del(5)(q32q33) TNIP1/PDGFRB |
Reserved |
XXXX |
1773 |
C421,C424 |
|
… about 1,000 items/lines |
|||||
99;99§ Extraosseous plasmacytoma |
Reserved |
XXXXXX |
1718 |
734/3 |
C421,C424 |
99;99§ Florid follicular hyperplasia PTLD |
Reserved |
XXX |
1788 |
C421,C424 |
AUTHOR |
|
"Translocation" |
DEADLINE |
COMMENTS |
XXX |
XXX@xx |
Florid follicular hyperplasia PTLD |
DONE |
3rd paper (leuk.) + 1 paper (gene) |
XXXX |
XXXX@xxxx |
del(5)(q32q33) TNIP1/PDGFRB |
????? |
Reminder 2017/06/26; 2017/03/21"Yes, will have this to you shortly"; Reminder 2016/11/19; Reminder 2016/06/17; 2016/01/17 no deadline ("soon"); 2015/10/14: OK |
XXXXX |
XXXXX@xxxx |
del(X)(p22p22) (P2RY8/CRLF2) |
16/03/2017 |
Spontaneous proposal |
Note: Tables used to identify all/each relevant item must be related (bijective type relation) with cards/papers; e.g. 05;00§ MDS with isolated del (5q) / ID 1134 <--> http://atlasgeneticsoncology.org/Anomalies/del5qSoleID1134.html
These tables may not exist per se. They would be integrated in the database of the Atlas (as, so far, some files as catalog, authors lists … on the INIST server. and automatically generated. But the right way will be to use ONLY screens developed ad hoc for editorial management.
Finally, we also have to format the cards/papers into word for the "scientific journal" version (see http://documents.irevues.inist.fr/handle/2042/15655 (e.g. http://documents.irevues.inist.fr/bitstream/handle/2042/62324/10-2014-HSPD1ID40888ch2q33.pdf , equivalent of http://atlasgeneticsoncology.org//Genes/HSPD1ID40888ch2q33.html ) of the Atlas.
A module is being finalized concerning the production of the scientific journal: It is a Web application developed with PHPWord/MySQL/Symfony Framework.
2.1 Actual process of management:
• Reception of a doc file (structured in the ad hoc template.
• Edition in a text file with good fields (validate the presence of a field in the beginning of lines (with eventually multiplicity of the same field)
• Addition of internal links (an expert task)
• Complementary procedure (by scripts)
- Correction of special characters (non compatibility between office Word and hypertext)
- Validation of bibliography with dowloading the short description of PubMed with PMID.
- Reordering biblio with alphabetic order
- Tests of good fields for each line (in good blocs)
- Test of links ( existent Atlas ID).
- Test of logical structure of each bloc (BEGIN.. / END)
- Transformation in hypertext documents (see further)
2.2 To be developed for a new management:
- On-line templates must be available to the authors who will write directly their paper (e.g. http://atlasgeneticsoncology.org/Forms/Gene_Form_for_submission.doc , corresponding to http://chromosomesincancer.org/en/atlas-templates-for-cards.html ). These templates must be capable of evolution when the Editor(s) in Chief wish to add or delete or modify a tag or even a "BEGIN_END". A password would allow the author to interrupt his writing, save, and come back latter until a last validation.
- A macro-instruction assisting the recognition of an internal hyperlink that the editorial staff member must add before publishing (e.g. ABL: a list of "NAMES" and aliases must help to recognize and propose the hyperlink to ABL1; the editorial member says OK, and the hyperlink to http://atlasgeneticsoncology.org/Genes/ABLID1.html comes automatically): --> need of a thesaurus (e.g. http://atlasgeneticsoncology.org/Tumors/Solid_Nosology.html
- PMID numbers solely permits to give the full reference
- Special characters are automatically transcribed (e.g. https://text-symbols.com/html/entities-and-descriptions , http://www.symbole-clavier.com , http://alexandre.alapetite.fr/doc-alex/alx_special.html ).
Module 3: Indexations
All the files of Atlas are considered as "objects" and are defined by an Atlas ID.
The standard of the filename of the objects is the following:
- Genes: 2 forms:
<String>ID<number><location>.txt Ex: ADCYAP1ID43656ch18p11.txt
<String>ID<number>.txt Ex: AF6ID6.txt
A generic form is built fr an easy links. The filename id defined as:
GC_<symbol>.txt e.g.: GC_ZFP90.txt
- Anomalies (Leukemias)
- Tumors
- Kprones
The filename is as : <string>ID<number>.txt ( ex :t3q21q26TreatRelLeukID1236.txt)
Creation (or Updating) of index or cards
As mentioned in the indexation.sh script (the main indexation performed after some modifications)
Indexation of Cards 1: script indexation.sh (in cytatlas/Scripts)
Used for re-indexation after new files or new data
http://atlasgeneticsoncology.org/Collab/Scripts/indexation.sh
1. Generation of all automatic genes
2. Generation of the main index file for all documents (ObjDB.txt)
3. Generation of some others indexes (ObjDBxx.txt ) (see in http://atlasgeneticsoncology.org/Collab/)
- ObjDB.txt
- ObjDB0.txt
- ObjDB1.html
- ObjDB1.txt
- ObjDB2.txt
- ObjDB3.txt
- ObjDB4.txt
- ObjDB5.txt
- ObjDB6.txt
- ObjDB7.txt
4. Generation of a catalog (text file with the information from the HEADER, see: http://atlasgeneticsoncology.org/Collab/catalog)
5. Transformation of the catalog (and "for sale" - "to be written" files) in tables with concatenation in a catalog_full.txt file)
6. Indexations of Genes (Geneliste.html), Leukemias (Anomliste.html), etc.
7. Indexation by chromosomes
8. Indexation by authors (different IndxAuthxx.txt / html in Collab) (IndxAuth3.txt is the main index for authors and affiliations)
9. Generation of Categories (several files are maintained before in parallel) for Cell Biology items
10. Generation of status (Genes .. Authors . etc.): http://atlasgeneticsoncology.org/Status/Status.html
11. Generation of Recent (last 2 years documents): http://atlasgeneticsoncology.org/Recent.html
12. Generation of COSMIC projects and TCGA/ICGC projets
13. Statistics (http://atlasgeneticsoncology.org/stat_atlas.html)
Possibility of mysql indexation for some items (query in the home page)
Module 4: Internal cross links between classes of cards
This section is important and gives a plus at the Atlas with numerous additions of links between cards. These links (Gene -> tumors, Leukemia <- genes …) enrich the quality and the expertise of the Atlas.
An automatic procedure parses all the cards and indexes the location of links (defined as a standard by the pattern ( <: TXT: text ID: AtlasID).
http://atlasgeneticsoncology.org/Collab/Scripts/maj_full.sh
This script has 2 goals: definition of all internal links
and
Generation of all hypertext files (readable on the net)
Map of one set towards another: injectivity/surjectivity:
Item |
Internal hyperlink toward |
1 Gene |
n1 Leukemias |
n2 Solid tumors |
|
n3 Cancer-prone |
|
1 Leukemia |
n4 Genes |
n5 Cancer-prone |
|
1 Solid tumor |
n6 Genes |
n7 Cancer-prone |
|
1 Cancer-prone |
n8 Genes |
n9 Leukemias |
Examples:
Item |
Hyperlinks toward |
||
Gene |
Leukemia |
||
Leukemia |
|||
Leukemia |
|||
Solid Tumor |
Lung Adenocar. t(9;9)(q34;q34) PRRC2B/NUP214 |
||
Gene |
Leukemia |
||
Solid Tumor |
|||
Cancer Prone |
|||
Leukemia |
Gene |
||
Gene |
|||
Cancer Prone |
Gene |
||
Gene |
|||
Solid Tumor |
|||
Solid Tumor |
Module 5: Management of external links
Files for annotation of genes:
A parallel management of several databases is regularly made (from NCBI, UCSC, UniProt, Ensembl, HGNC, COSMIC, Mitelman (NCI) …)
All genes defined in the Atlas are based of the update list of gene symbols of human Entrez_genes . A great part of annotations are also associated ones in the ftp files of (ftp.ncbi.nih.gov/gene/DATA/ and /gene/GeneRIF). They are managed by script to lead in 2 tabulated files (genes_gc.txt and genes_gn.txt ) , the first one for « cancer genes » (some words in description and/or GeneRif in Entrez Gene),
the second for other genes of Entrez_Gene (NCBI). Genes used are limited to a genome location (hg38) defined in http://hgdownload.soe.ucsc.edu/goldenPath/hg38/database/refGene.txt.gz.
See: for a definition of all the fields keep updated and used in External links of the cards on Genes: External_annotations_genes
Module 6: Statistics
- Topics/Items for the Atlas (genes, leukemias, solid tumors and others):
- Need of Tables (e.g. http://atlasgeneticsoncology.org/Collab/ID-TRANSLOC.txt ) to know what is done, what is done but old, what is reserved to a given author, what needs an author to be found. Allows the addition of new items.