USERS of the

Atlas of Genetics and Cytogenetics in Oncology and Haematology

http://AtlasGeneticsOncology.org

 

There are many random visits, as is usual with internet queries; on the other hand we are aware that many professionals (such as cytogeneticists) prefer to enter a keyword directly on Google, such as "t(9;22)(q34;q11)" than passing through the home page of the Atlas, which introduces a bias the other way: a query from a robot may also come from a frequent/professional user.

 

However:

1.       There are regular users, from University and/or Hospital cities.

2.       Many users are from academic sites.

3.       A number of private companies also use the Atlas.

4.       The Atlas has an international audience, not only in rich countries, but also in emerging countries.

 

- 5,500 unique visits every day

- 1.5 million visits a year

- 25% from the USA,

- but also frequent use from emerging countries

 

Figure 1 Regular users are our real core target (Cytogeneticists, Hematologists, Researchers, University Teachers).

 

Figure 2 Geographical Origin: dominance of the USA


 

Figure 3 Consultation in North America centered on university and/or medical areas: New York, Houston, Chicago, Los Angeles, Boston, San Diego, Salt Lake City, Montréal, Philadelphia, Toronto

 

Figure 4 Consultation in Western Europe:Paris, Lyon, Munich, Toulouse, Brussels, Lille, Berlin, Vienna, Montpellier, Zurich, Nantes …

 

Figure 5 Consultation in Northern Europe: London, Dublin, Glasgow, Sheffield, Edinburgh, Oxford, Helsinki, Stockholm, Cambridge, Copenhagen …

 

Figure 6 Consultation in Southern Europe: Madrid, Milano, Roma, Barcelona, Napoli, Valencia, Seville, Athens, Turin, Lisbon …


 

Figure 7 Consultation in Southern America: Bogota, Santiago, Buenos-Aires, Quito, La Victoria, Medellin, Caracas, Cali, Sao Paulo, Barranquilla …


 

Figure 8 Consultation in South Asia: New Delhi, Bengaluru, Mumbai, Chennai, Hyderabad, Kolkata, Pune, Tehran, Lahore, Karachi …


 

Figure 9 Consultation in Eastern Asia:  Seoul, Hong Kong, Beijing, Shanghai, Wuhan, Gyeonggi-do, Osaka, Busan, Minato, KyotoQuezon, Bangkok, Singapore, Kuala Lumpur, Selangor, Manila, Makati, Cebu City, Ho Chi Minh, Jakarta …


 

Figure 10 Consultation in Africa: Tunis, Casablanca, Cairo, Algiers, Lagos, Cape Town, Rabat, Dakar, Alexandria, Johannesburg …

 

Figure 11 Consultation in Australasia: Sydney, Melbourne, Brisbane, Perth, Adelaide, Auckland, Canberra, Wellington, Newcastle, Christchurch …

 

 

 

 

Project title

 

'Atlas of Genetics and Cytogenetics in Oncology and Haematology'

internet journal / encyclopaedia /database http://AtlasGeneticsOncology.org

 

 

Summary

 

The Atlas is a peer reviewed internet journal / encyclopaedia / database devoted to:

- genes implicated in cancer,

- cytogenetic or clinical entities in cancer, and

- hereditary diseases which are cancer-prone conditions.

Size of the Atlas in 2015: number of pages: 45,500; 3,500 authors; 5,500 visits every open day; 1,1 million unique user every year (USA 27%).

 

 

Key words

 

- Pooling of knowledge concerning the biology of normal and cancerous cells,

- Referential and investigation tool for research,

- Multilingual pedagogical support,

- Translational health research

- Transfer of scientific innovation towards research itself, and, downstream, towards patient care

 


Context

 

25,000 new publications concerning cancer genetics in man are added each year in PubMed. No one, anymore, has the whole required knowledge, necessary to guide the treatment procedure in case of a rare disease. Huge database are therefore needed, to collect and summarize data on these rare diseases, and produce meta-analyses.

 


Objectifves of the project


- Medical treatment assistance in rare forms of cancer,

- Efficiency savings in the fight against cancer,

- Decrease in fundamental and applied research as well as medical costs.

- Personalised medicine for cancer (one of the axes of the cancer plans).



I- Scientific Background

General description / relevance and originality of the project Prognosis of a leukaemia depends on the genes involved: 5 years survival: 6% in the inv(3)(q21q26) RPN1/MECOM leukemia, 100% in the  dic(9;12)(p13;p13) PAX5/ETV6 leukemia. And treatments, indeed, depend on the severity of the disease. However, 2,000 to 9,000 genes are possibly implicated in cancer, and 1,200 types of solid tumours exist. Some cancers are frequent while many others are very rare (only 1 published case), this is particularly true for leukemias subtypes … but there are more than 900 leukemias!No one, anymore, has the whole required knowledge, necessary to guide the treatment procedure in case of a rare disease. Huge database are therefore needed, to collect and summarize data on these rare diseases, and produce meta-analyses. L’ Atlas contributes to 'meta-medicine', this mediation between the knowledge and the knowledge users in medicine.

II- International positioning of our team / Current organization

The Atlas is one of the pioneers of the Internet, it started in 1997. It meant writing short cards on genes and chromosomes implicated in cancer. The Atlas is now 45,500 pages big, written by more than 3,500 authors, from about 50 countries (France, USA, Italy, United Kingdom, Germany, Japan, Spain, Canada, China, The Netherlands ...).

The Atlas is in free access, which is particularly useful for third world countries or for students.

- Topic research The Atlas participates in research on cancer epidemiology. The Atlas is a tool in genomic studies of the latest generation (see Science 28 Jan 2011, vol 331 p 435-439, suppl data, where the authors indicate that they have used the Atlas and Cosmic as reference databases to choose and test a series of genes in medulloblastoma in childhood). Nonetheless, the Atlas' aim is not to overlap with the databases daily used in molecular biology (UCSC, Ensembl, Entrez, Cosmic, Swiss-Prot). However, as these data are in the public domain, we could easily produce them, according to the needs of the community.

- Topic teaching and continuing medical education The Atlas is at the crossroads of research, university and post-university teaching (virtual medical university) and telemedicine. 

- Database on genes: Where else can be found so many detailed articles on genes? see: PTN or BCL6 http://atlasgeneticsoncology.org//Genes/PTNID41904ch7q33.html and http://atlasgeneticsoncology.org/Genes/BCL6ID20.html.

- Electronic journal version of the Atlas An Open access electronic journal /pdf version of the Atlas is being developed by Institute for Scientific and Technical Information (INIST) of the French National Centre for Scientific Research (CNRS). Available are the archives of a quarterly journal since 1997, which became a bimonthly journal in 2008 and a monthly journal in 2009, comprising 2,500 articles in more than 120 volumes, which constitutes a 10,000 pages collection, available at: http://irevues.inist.fr/atlasgeneticsoncology, allowing the Atlas, soon, to be referenced by the main bio-medical databases, including PubMed.

- The Atlas combines various types of knowledge all on one site: Genes and their function, cell biology, diseases, cytogenetics, but also clinical genetics, including hereditary diseases which are cancer-prone conditions. This tends to unify cancer genetics, while data are elsewhere dispersed between several sites[1]. The iconography in the Atlas is diverse (medical imaging, pathology, chromosomes, 3-D structure of proteins, genetic maps...), which is not found in other sites (apart from genetic maps that can be found in GeneBank, Ensembl …). The Atlas is the only site devoted to genetics where the prognosis is quoted:  It has always seemed surprising to us that such a crucial data is just ignored in other sites. There are more than 17,000 internal hyperlinks in the Atlas.

- Diagnosis assistance and information-based therapeutic decision The Atlas contributes to the cytogenetic diagnosis and may guide treatment decision making, particularly regarding rare diseases (because they are numerous, rare diseases are frequently encountered).

- Developments in cell biology and physio-pathology: data in the Atlas are a definite resource in cell biology and physio-pathology, that we are just beginning to harness (e.g.: Apoptosis: http://atlasgeneticsoncology.org/Categories/Apoptosis.html ).

- Towards a personalized medicine of  cancer: From our section "Genes", can be extracted 613 genes implicated in colorectal cancer, 750 in breast cancer, and 494 genes in prostate cancer (see paragraph "Other genes implicated " at: http://atlasgeneticsoncology.org/Tumors/breastID5018#EXTRACTED). With the fast development of technics in genetics, it now emerges that many subtypes of solid tumors may exist, following the leukemia model (how many hundreds of breast cancer subtypes, defined by distinct genetic profiles, to be uncovered?). Recently, new data on lung adenocarcinoma made possible to consider personalized medicine (see http://atlasgeneticsoncology.org/Tumors/TranslocLungAdenocarcID6751.html ). This, together with cell biology developments in the Atlas demonstrates that the encyclopedic content is potentially a basis for developing personalized medicine for cancer. It remains to present this knowledge in such an affordable manner that the Atlas truly becomes a tool for the staff meetings. 

Renown Prestigious journals, such as Science and Nature Reviews Cancer, have written about the Atlas The Atlas is regularly cited as a journal in leading scientific publications (Annual Review of Biochemistry (Impact Factor 29,88), Science (29,75), Nature Reviews Cancer (29,54), Cancer Cell (25,29), Nature Cell Biology (19,53), Journal of the National Cancer Institute (14,07), American Journal of Human Genetics (12,30), Molecular Systems Biology (12,13), Genes and Development (12,08), Genome Research (11,34), Trends in Molecular Medicine (11,05), Blood (10,56), PNAS (9,43)...). The Atlas is also cited as a reference database in Science. Hyperlinks towards the Atlas have been developed by the National Cancer Institute USA, Swiss-Prot, GeneCards, the Sanger Institute, Mitelman database au NCI, etc... Renown of the Atlas may be checked at: http://chromosomesincancer.org/en/jce/acknoledgements-to-the-atlas.html (Felix Mitelman wrote that the Atlas "has grown into a truly monumental encyclopaedic work of great importance to cancer research (...) an invaluable reference and resource for scientists and clinicians").

Who uses the Atlas?The Atlas is accessed by: 1- Research: cytogenetics, molecular biology, cell biology researchers; 2- Hospital: clinicians, haematologists, cytogeneticists, pathologists, from the teaching hospitals, indeed, but also from general hospitals where the Atlas is one of the rare free resources. Junior doctors in haematology or oncology, are also most receptive to the Atlas that they see as a training and educational tool; 3-Students.

The Atlas receives more than 1,100,000 visits every year (25% of the visitors are from the USA, 12% from France, but many other visitors are from the developing countries. 200 cytogenetic labs come every day, 1,000 visitors more than once a week (cytogenetic labs./university teachers-researchers), and 6,000 visitors more than once a month (university teachers - researchers, students, staff meetings on blood malignancies). 5,500 individual machines connect to us every day. A bug, on May the 23rd, 2008, was devastating: pages on chromosomes, with their data, had vanished; many mails were sent to us, proving that the Atlas was (is) indispensable; see: http://chromosomesincancer.org/en/jce/acknoledgements-to-the-atlas.html#Incident ). See also recent testimonies at :http://atlasgeneticsoncology.org/Supporting_Atlas_First_signatories.pdf

The Atlas is also operated by various biotech and pharma to implement their internal systems and databases.

Comparison with the Mitelman Databasethe Atlas is certainly not redundant with the "Mitelman". As a matter of fact, the Mitelman and the Atlas are complementary: the Mitelman is exhaustive but gives rough data, with no annotation, while the Atlas presents a meta-analysis of the data, giving an overview on a given disease, describing the main clinical characteristics, with, when possible, an iconography of chromosomes (e.g. see: t(1;11)(p32;q23) in each of the two sites).

Scientific societies have decided to grant the Atlas, they thus give a scientific credibility and a clear and real will to the Atlas sustainability and perennialty: Association des Cytogénéticiens de Langue Française et Groupe Francophone de Cytogénétique Hématologique, Belgian Society of Human Genetics et Belgian Cytogenetics Group in Haematology and Oncology, Dutch working group of Tumor Cytogenetics, Berufsverband Deutscher Humangenetiker e.V. et Gesellschaft für Humangenetik e.V., Societa Italiana di Genetica Umana, Grupo Cooperativo Español de Citogenética hematológica and Sociedad Española de Hematología y Hemoterapia, Australasian Society of Cytogenetics. 

Structure of the board of the Atlas

- Jean-Loup Huret, associate professor and consultant, CHU Poitiers, editor in chief and Philippe Dessen, research director, CNRS-Institut Gustave Roussy, database director;

- Because, historically, the project could not have been supported by institutional players, a non-profit association (french law 1901) was created (see http://chromosomesincancer.org/en/ ) (president Jean-Loup Huret, geneticist, vice president Hossein Mossafa, geneticist, treasurer Bernard Drochon, external auditor, secretary Martine Jammet, entrepreneur, all volunteers). This association have employed 5 to 6 MSc or PhD co-workers (the main area of expenditure: 30,000 Euro (salary + charges) for each collaborator, but they make an indispensable work -since the Atlas is much more an encyclopedia than a database, requesting a good amount of expertise). The association comprises about 60 members (50% french, 25% americans).

- A scientific committee of the association (http://chromosomesincancer.org/fr/jce/conseil-scientifique.html ), including delegates from various scientific societies; and there is also an editorial board for the Atlas (http://atlasgeneticsoncology.org/Backpage.html#EDITORIAL ).

III- Development steps

To reach its principal object -the patient service-, the Atlas must become "clinic orientated". To encourage the clinical physician to use the Atlas in his decision making, the Atlas must provide the required information, and such in a "clinician friendly" manner.

A part of our activity must be dedicated to bioinformatics developments, at a time when new fields of cytogenetic are developing with the massive use of FISH, CGH, exome and genome sequencing. Numerous and precise data on mutations, structures variations as well as on gene fusions are taking over the literature.

Adding information with a clinical orientation and into a customized therapeutic care. Developing the knowledge and the scientific innovation transfer into the care system, in particular as part of targeted therapies. Developing decision trees (using the High Authorities recommendations for each pathology); making a list of all the examinations to be performed according to each step of the



 

 

Editorial workflow in the Atlas

The Atlas editorial team stands in need of Section Editors (see http://atlasgeneticsoncology.org/BackpageAbout.html#EDITORIAL ) devoting 10% of their time for the Atlas.

The workflow detailed herein below applies when MSc students/workers are involved in the process. Indeed, a senior researcher will skim through the various steps described below. However a secretary's active watch on forthcoming papers from the various authors is needed (like an appointment calendar). Moreover, when the Atlas has evolved into a true database with an automated editorial process, many of these time consuming tasks will vanish.


I- Subject-by-subject expertise

For each of the themes included in the atlas (genes, leukemias, solid tumors, hereditary diseases involving increased risk of cancer, "deep insight" …), the editorial team is tasked with appraising the subjects (or "items") lending themselves to publication on the site.

As regards genes, for instance, using  a predefined list of the genes potentially implicated in cancer (a list determined by algorithms for different data bases by Philippe Dessen, the data base manager), the editorial team is initially tasked with verifying the implication (or non-implication) of these genes in carcinogenesis. With this in mind, they study the existing bibliography, primarily on the basis of the information contained in PubMed. They then endeavor to determine whether or not the published data suffice to justify a detailed written review.  

As regards the other themes (leukemias, solid tumors, hereditary diseases involving increased risk of cancer...), the bibliographic research is also carried out from PubMed, and also on specialized sites (Mitelman, Orphanet, OMIM, WHO, …). This research phase favors subjects for which enough recent publications are available to allow for high-quality reviews.

Finally, the editorial team decides whether or not a given subject will constitute an item justifying publication. After that, it is a matter of searching for and selecting the author in the best position to write this review.


II- Searching for the right author

Once a subject has been determined, the editorial team conducts a search of recent publications in view of choosing the author/the team in the best position to write something for the Atlas: their second appraisal. Concerning their choice, the following factors have got to be taken into account: number of publications by the author, the impact factor of his or her publications, and the author's status: clinician, hospital-based biologist, researcher. The author's disciplines are likewise taken into consideration: oncology, a specific organ, cytogenetic biology, molecular biology, cellular biology, physiopathology, etc. As regards teams, a multidisciplinary team will be preferred.

This is a highly sensitive step. According to the author chosen, the quality of the review is likely to vary, and its scientific orientation will be more or less fundamental or clinical.

Other selection criteria include: qualities of clarity in data presentation, synthesis and the use of schemas or medical imagery (in the broad sense of the word, including anatomopathology and chromosomes).
 

III- Managing the contact list spreadsheets - Managing the spreadsheet identifiers

Along with the different items (genes, leukemias, solid tumors...), all important data on the authors are conserved in dedicated files.

And for each gene, the spreadsheets inventory information on its implication, as well as all other information deemed useful for future research (authors to be contacted, their contact information, e-mail exchanges, agreement to write an article, speed in production, and subjects to be reserved for a given author...). Totally indispensable, the spreadsheets are of concrete use during each phase of the work. They need to be ergonomic, and evolve according to changing needs. 

For each theme in the Atlas (Genes, Leukemias, Solid Tumors, Hereditary Diseases involving increased risk of cancer, "deep Insights", Teaching Chapters), there exists a dedicated spreadsheet.

Each spreadsheet provides a list of the subjects studied, the relevant dates, as well as the subjects to be studied, for which the right authors remain to be found.

The spreadsheets are updated daily, and new, potentially interesting subjects are added periodically. This supplementary material is enriched by readings of articles from the literature and by analysis of the articles received by the Atlas.   

Each subject is designated by a specific and numbered identifier facilitating the establishment of internal links as the files are being edited.

Possible duplicate files are regularly sought out and subjected to analysis, and the spreadsheets are consequently "cleaned".  

For an example, see below: Extracts from the "Genes" spreadsheet, which contains 9000 lines.


HOXA9 (7p15)

61

07p15

HOXA9

homeo box A9

t(7;11)/ANLL

done

PAX5 (9p13)

62

09p13

PAX5

paired box gene 5 (B-cell lineage specific activator protein)

t(9;14)/NHL

done

NUP98 (11p15)

63

11p15

NUP98

nucleoporin 98kDa

t(7;11)/ANLL

done

PICALM / CALM (11q21)

64

11q21

PICALM

phosphatidylinositol binding clathrin assembly protein

t(10;11)/ANLL

done

DDX6 / RCK / LPC (11q23)

65

11q23

DDX6

DEAD (Asp-Glu-Ala-Asp) box polypeptide 6

t(11;14)/NHL PMID 1394235

FOR SALE

TCL1A / TCL1 (14q32)

66

14q32

TCL1A

T-cell leukemia/lymphoma 1A

inv(14)ort(14;14)ort(X;14)/T-cell

done

BCL3 (19q13)

67

19q13

BCL3

B-cell CLL/lymphoma 3

t(14;19)/NHL/ChrLympho

Reserved

PAX7 (1p36.13)

68

01p36

PAX7

paired box gene 7

t(1;13)/rhabdo

FOR SALE

PRCC (1q21)

69

01q21

PRCC

papillary renal cell carcinoma (translocation-associated)

t(X;1)(p11;q21)renal carcinoma

done

PAX3 (2q35)

70

02q35

PAX3

paired box gene 3 (Waardenburg syndrome 1)

t(2;13)/rhabdo

done

CTNNB1 (3p21)

71

03p21

CTNNB1

catenin (cadherin-associated protein), beta 1, 88kDa

t(3;8)/adenoma PMID: 23490077, 24042511, 25124581

done

LPP (3q27)

72

03q27

LPP

LIM domain containing preferred translocation partner in lipoma

t(3;12)/lipoma

done

ETV1 (7p22)

73

07p22

ETV1

ets variant gene 1

t(7;22)/Ewing

done

PLAG1 (8q12)

74

08q12

PLAG1

pleiomorphic adenoma gene 1

t(3;8)/adenoma

done

NR4A3 / TEC / CHN (9q22)

75

09q22

NR4A3

nuclear receptor subfamily 4, group A, member 3

Extraskeletal myxoid chondrosarcoma with t(9;22)(q22;q12) or t(9;17)(q22;q11) or t(9;15)(q22;q21)

done

RET (10q11)

76

10q11

RET

ret proto-oncogene (multiple endocrine neoplasia and medullary thyroid carcinoma 1, Hirschsprung disease)

inv(10)/adenocar thyroid

done

AKAP10 / RIa (17q23-24)

77

17q23

AKAP10

A kinase (PRKA) anchor protein 10

thyroid

FOR SALE

WT1 (11p13)

78

11p13

WT1

Wilms tumor 1

t(11;22)/desmoplastic

done

FLI1 (11q24)

79

11q24

FLI1

Friend leukemia virus integration 1

t(11;22)/Ewing/PNET

done

DDIT3 / CHOP (12q13)

80

12q13

DDIT3

DNA-damage-inducible transcript 3

t(12;16)/liposarc

done

ATF1(12q13)

81

12q13

ATF1

activating transcription factor 1

t(12;22)/clear cell sarc

done

 

IV- Contacting the author - Managing author responses -

The authors are contacted through a common e-mail account using predetermined contact e-mail coordinates.

The e-mails are sent once a week, at the beginning of the week, as it has been observed that the highest response rate occurs at that time. On the same token, some times of the year are not conducive to soliciting authors who tend to be busy with applications for grants ... or on holiday. These periods are avoided as much as possible.

When an author fails to answer, he is re-contacted two or three times. If, after that, he still fails to answer, another author is sought out...

Author response statistics: 55% of the authors do not respond to the invitation; 25% of them refuse; 20% agree.

When authors refuse or do not respond, the process of searching for the right author is reinitiated. If no other author can or will write the requested review, the team will wait for new research to be published..

When, on the other hand, an author accepts, the team consults him on the time frame he thinks he shall need prior to sending his completed article. The publishing flow is thereby managed, both for the Internet site and the PDF journal. A model of writing for the review is given the author, who concomitantly receives relevant editorial information.

Quite frequently, articles fail to arrive within the allotted time. The status of ongoing articles has got to be regularly monitored, and reminder notes need to be sent. 

It also often happens that notwithstanding an initially positive response, authors do not respond to reminder notes, or else they withdraw from the project. In those cases, search for the right author resumes.

Assistance for authors: Quite frequently, authors ask questions (deadlines , explanations pertaining to various details...) that call for timely responses.

 

V- Receipt of the worksheets - Flow management

 Once the article has been received, the completed worksheet is registered on a common server.

The information is also recorded on common spreadsheets, in a common notebook dedicated to the editors' ongoing activities; it is also displayed on wall calendars providing overall perspective on publication flows.

With these calendars, the editorial team can rapidly apprise itself of a possible lack of articles for the PDF journals and try to optimally manage the flow of arriving articles. Unfortunately, given the fact that the authors are the ones who determine their sending dates, substantial variations in publication flow may come to exist, and not be easy to manage.

 

VI- Verification of the articles and assessment by referees (referring process)

The articles received are reread by internal referees; 97% of them (in fact all the articles, except for clinical case reports) are reviews of the literature written at the request of the Atlas editorial board; they are known as "commissioned papers". It should be noted that the authors were preselected according to rigorously applied criteria preliminarily to their writing of an article. Following receipt, the process of rereading and assessment of articles is implemented by the Atlas editorial board. It should also be noted that as the process draws to a close, the board member having chosen the subject (and subsequently the author himself) possesses a precise overall vision as to what the article is supposed to produce (quantity of knowledge in molecular biology or clinical research on a given gene...).     

Articles that do not correspond to the publication criteria or that do not provide a sufficiently detailed report on the knowledge available on the subject are sent back for revision. The editorial team remains vigilant, and about 20% of the articles submitted are indeed sent back for revision. It is often necessary to send reminders to the authors, before receiving the revised version (loss of time).  

The clinical case studies in hematology (the remaining 3%) are subjected to an external assessment process involving 4 or 5 experts, while the final decision is made by the Atlas editor-in-chief. Practically all the clinical case studies are reviewed by the authors at least one time prior to publication; only 8% are immediately accepted, 68% are accepted following revision, and 24% of the articles having been submitted are finally refused.
 

VII- Publication of the articles - html coding  

Once the articles have been accepted, they are processed so that they can be placed on line by the data base manager.

The articles are to be formatted according to a strictly applied computational model in order to be properly handled by the scripts of the data base manager (see, as an example, the form below).

The text must be painstakingly reread so as to correct possible mistakes (typos, grammar, spelling...),   and each bibliographical reference cited in the next has got to be verified and corrected (if need be).

The article header (the different tags or tracking devices allowing for indexing of the form in different parts of the data base in accordance with different criteria) must be carefully filled out, using the metadata contained in the article. The metadata will permit the data base manager to integrate the form within the site so that it can be inventoried with regard to the dedicated lists.  

The articles in the Atlas must follow a predetermined model; each paragraph of the review must be included within the tags, and the text must be formatted according to a computer encoding system. Special characters (accent marks, Greek letters, symbols…ex: integrin ανβ3) must be coded in the HTML format so that their posting on the website be correct. 

In the text itself, all potential internal links to other items contained in the Atlas (genes, leukemias, solid tumors...) must be identified, and internal links created in such a way that readers are directed, if they wish, to other articles in the Atlas. New subjects for the Atlas are often found while the forms are being processed, and added to the shared spreadsheets.  

Most often, the bibliography contains 50 to 100 references, and it has got to be processed in a predetermined format.

The articles also contain images, which are processed by Photoshop software. More often than not, the size of the images requires rectification, and the background of the image has got to be rendered transparent. Moreover, the text may need rewriting when it is too blurry or has been degraded due to the change in size.  

Once processing of the worksheet has been completed, it is put through a simulator so as to verify the correctness of its formatting, and also so as to make sure that no component remains invisible, as happens when a closing tag such as > is forgotten. 

The file and the attached images are then sent for validation to Jean-Loup Huret, the editor-in-chief, and subsequently to the data base manager for uploading or posting on line.  

"Deep Insight" and "Educational items" constitute special cases. As these files are "free format" and do not correspond to a precise model, they are directly processed using specialized software (the Dreamweaver website-editing software) in the HTML format. For these files, it is necessary to carry out all coding and page layout in HTML.

Example below : Partial formatting of an article: In the example given below, any and all parts of the text in color are subjected to special processing.

BEGIN_HEADER

FILENAMESH3PXD2AID45995ch10q24.txt

CLASSE GENE

ID45995

LOCUSID

TRI_PAR_CHROMOSOME10

TRANSLOC t(10;10)(q24;q24) SH3PXD2A/OBFC1

TRANSLOC t(10;13)(q24;q14) SH3PXD2A/RB1

FUSION_GENESH3PXD2A/OBFC1

FUSION_GENESH3PXD2A/RB1

CATEGORYCytoskeleton

END_HEADER

etc..

 

etc...

FUNCTIONTKS5 was initially identified as a substrate for <CC: TXT:SRC ID: 448> (Lock et al., 1998), and was subsequently shown to play a critical role in invadosome formation in multiple cell types (Courtneidge, 2011; Murphy and Courtneidge, 2011; Paz et al., 2013). <BR>

FUNCTIONFull-length TKS5 functions as an adaptor for recruiting other proteins to the cell membrane for invadosome formation. The recruitment of TKS5 to the cell membrane depends on its PX domain and phosphorylation by Src (Abram et al., 2003). It has been proposed that phosphorylation of TKS5 releases its PX domain from intramolecular interaction and allows TKS5 to bind to cell membrane phosphatidylinositol lipids, such as phosphatidylinositol-3,4-bisphosphate (PI(3,4)P2) (Abram et al., 2003; Oikawa et al., 2008). At the cell membrane, TKS5 is thought to interact with multiple components of invadosomes either directly or indirectly, and thereby mediates invadosome formation and maturation (Sharma et al., 2013). These interacting partners includes adaptor proteins and actin regulatory proteins, such as <CC: TXT:NCK1 ID:41505>, <CC: TXT:NCK2 ID:52582>, <CC: TXT:GRB2 ID: 386>, <CC: TXT: CTTN ID: 369> (Cortactin), <CC: TXT: WASL ID: 42803> (N-WASP), <CC: TXT: ACTR2ID: 49744<CC: TXT: ACTR3ID:46303> (Arp2/3) complex, and <CC: TXT: ARHGAP35 ID: 52237> (p190RhoGAP) (Crimaldi et al., 2009; Oikawa et al., 2008; Stylli et al., 2009).<BR>

FUNCTION TKS5 also interacts with <CC: TXT:NOXA1 ID:41563> and <CC: TXT: CYBA ID: 43882> (p22phox), which are components of the NADPH oxidase complex, and thereby promotes reactive oxygen species (ROS) production by NOX enzymes at invadosomes (Diaz et al., 2009; Gianni et al., 2010; 2009).ROS have been shown to facilitate invadosome formation by maintaining or amplifying the phosphorylation of TKS5. As such, TKS5 is thought to promote invadosome formation via ROS in a positive feedback loop.<BR>

FUNCTION Finally, TKS5 has also been shown to interact with members of the ADAM family metalloproteases, specifically <CC: TXT:ADAM12 ID:44084>, <CC: TXT:ADAM15 ID:46345>, <CC: TXT:ADAM19 ID:46837>(Abram et al., 2003). It is believed that Tks5 recruits theses proteases to the invadosome foci for processing growth factors and regulating cell motility. For example, ADAM12 has been shown to promote ectodomain shedding of <CC: TXT:HBEGF ID: 40369> (heparin-binding EGF-like growth factor) and enhance invadopodia formation in cancer cells (Diaz et al., 2013). <BR>

etc...

 

VIII- Posting (uploading) and verification

Article upload takes place virtually every week, and for each article, it has to be verified.   Also to be verified: Has the article been correctly indexed in  the dedicated lists? Has the article been correctly displayed? .


IX- Author acknowledgment

Before a thank-you e-mail is sent to the authors of a review, their bibliographies are searched so as to see if they could possibly be tasked with another review. If this is the case, they are asked whether or not they would be interested in pursuing their collaboration.   

The author is given the address at which he can find his review on the Atlas site.

 

X- Creation and management of the PDF journal  

http://documents.irevues.inist.fr/handle/2042/15655

Prior to each issue, the editor in charge of the journal is tasked with creating the table of contents and scrupulously selecting the articles to be published. This is because publishing workflow is highly fluctuating, and consequently demands "hands-on" management.

A tool for the future data base carrying out initial formatting of the articles and editing the table of contents provides assistance in the page layout of the PDF journal.

In each issue, page layout is a sensitive matter. It is performed article by article, and has got to be done in such a way that a given issue is harmonious and satisfactorily balanced; moreover, image management must ensure maximal quality.

Once the journal is formatted and paginated, it is validated by the editor-in-chief.

The issue is then sent to the INIST-CNRS for finalizing of the PDF. The INIST editors recover the meta-data for the articles and associate them with the issue. They are also tasked with assigning the DOI (digital object identifier) for each article that will serve as its single identification number.  

For back-up purposes, this final PDF is registered on the future data base.

Once all of the above steps have been carried out, the authors are contacted anew and informed of the publication of their articles in the journal and the link to the latter in PDF with which they are being provided.  

 

Table of working time distribution for each Atlas editor (in percentage, for full-time service: 35h/week  

Task

% Equivalent full time

Subject expertise

94

Search for authors

30

Contact list spreadsheet management

42

Contacting authors

47

Managing author responses

39

Managing the spreadsheet identifiers

16

Receipt of the worksheet forms

5

Verification and assessment of the articles

tio

Publication of the articles

125

Uploading and verification of the articles

15

Addressing acknowledgements to the authors

15

Integration of the new site (BDD)

10

Creation and management of a PDF journal

10

Management and development of the new site (BDD)

10

Management of association members and donations

5

Management and maintenance of the promotional site

1

Maintenance and development of the current site

9

Miscellaneous

10

TOTAL

500

 The above percentages are subject to modification according to editorial needs and the fluctuating importance of certain tasks.

Again, we want to make it clear that the workflow detailed herein applies when MSc students/workers are involved in the process. Indeed, a senior researcher will skim through the various steps. However a secretary's active watch on forcoming papers is needed. Moreover, when the Atlas has evolved into a true database with an automated editorial process, many of these time consuming tasks will vanish.

The Atlas can run with:

- an Editor in Chief (20% full-time) + secretary (2-5%).

- Section Editors (5- 10% full-time) + secretary (1%).

- a Bio-computer specialist (10% full time).

- .. and as many good authors as possible !

 

 

 

 

Added value of the Atlas

 

Encyclopedia containing original monographs written by different authors, the Atlas combines different types of knowledge. No other web site presents so many monographs on genes, very rich iconography. It is a tool for researchers in genomics, for clinicians as a help in diagnosis and therapeutic decision, for a personalized cancer medicine. The Atlas is not only a web site and an encyclopedia, but also a scientific journal.

In conclusion, it is an original/unique database without equivalent.

 

Positive aspects of the Atlas

 

Encyclopedia composed of documents that are original monographs written by invited authors, based on their expertise in a given field (ex: Carlo M Croce on "Common fragile sites and genomic instability", Rolf Marschalek on the KMT2A (MLL) gene, George C Prendergast on the IDO1 and 2 genes, Cécile Badoual on "Head and Neck paragangliomas", Maurizio Genuardi on "MUTYH-associated polyposis", Felix Mitelman on "Cancer Cytogenetics"). Other databases present texts on genes (for example Aceview) but these are non-expertized texts, resulting from data mining. GeneCards is a database with links to other databases, with few data copied from these databases.

 

Starting first from cytogenetics, the Atlas combines different types of knowledge in a single web site: genes and their function, cell biology (ex: Apoptosis:http://atlasgeneticsoncology.org/Categories/Apoptosis.html), pathological data, diseases and their clinical implications, cytogenetics, but also medical genetics, with hereditary disorders associated with an increased risk of cancer. This gives a wider and more global view of cancer genetics, while these data are usually dispersed[1]. The Atlas is the only genetic site where the prognosis is included; it has always surprised us that data so important such as prognosis could be absent in other sites… even more now in the context of personalized cancer medicine.

 

No other site, journal or book has so many monographs on genes (more than 1,400 genes; ex:http://atlasgeneticsoncology.org//Recent.html), together with 28,000 non-annotated cards based on the GeneCard type (ex: http://atlasgeneticsoncology.org//Indexbychrom/idxa_11.html, see paragraph on "Other genes").

 

Very rich iconography on chromosomal aberrations (700 images, ex:http://atlasgeneticsoncology.org/Anomalies/t0609ID1014.html), on genes (3,000 images ex:http://atlasgeneticsoncology.org/Genes/EWSR1ID85.html), on clinics and pathology (400 images, ex:http://atlasgeneticsoncology.org/Tumors/OralMelanomaID6647.html).

 

Research axis: the Atlas is a tool for researchers in genomics (see Science, 2011). Documents on genes having more and more importance in the analysis of an abnormality, mutations and fusions in relation with NGS: the Atlas is a tool allowing an expert selection of genes involved in cancer. It has been referenced several times as a pertinent source in relation with an abnormality, more particularly as identification of genes associated with the abnormality.

 

Help in diagnosis and therapeutic decision: the Atlas guides the cytogeneticist by its iconography on chromosomal abnormalities, informs the clinician on rare pathologies, allowing him to adjust his treatment. The Atlas is particularly indispensable in rare diseases (because rare diseases are numerous, these are frequent). All the cytogeneticists from all around the world regularly use the Atlas, as it appeared with recent financial issues[2].

 

Towards a personalized cancer medicine: 322 genes involved in colon cancer, 457 in breast cancer and 589 in prostate cancer can be extracted from the "gene" section (see paragraph "Other genes implicated"http://atlasgeneticsoncology.org//Tumors/breastID5018.html). With the development of new techniques, it is now evident than solid tumors, as leukemia, have numerous sub-types (how many hundreds of breast cancer with different genetic profiles?). This, added to the development in cell biology, shows that the encyclopedic contents of the Atlas is potentially a basis for the development of personalized cancer medicine.

 

The Atlas is not only a web site and an encyclopedia, but also a scientific journal. The pdf format of the Atlas (http://irevues.inist.fr/atlasgeneticsoncology) constitutes the archives of a journal published 4 times a year since 1997, then 6 times in 2008, and 12 times in 2009. It includes 2,500 articles in more than 100 volumes, and 9,000 pages to allow us to be referenced by PubMed in the future.

 

Ergonomy: Access by pages (ex: chromosomeshttp://atlasgeneticsoncology.org/Indexbychrom/idxa_14.html) , by table of anomalies, genes and fusion genes by chromosomal band (ex:http://atlasgeneticsoncology.org/Bands/11q23.html#GENES), with the more recent data on chromosomal abnormalities and fusion genes compiled (TCGA, Cosmic, TicDB, etc.). Synthetic tables (ex: nosologyhttp://atlasgeneticsoncology.org/Tumors/Solid_Nosology.html, patterns, categories, etc.).

 

The initial structure (independent documents) was improved to allow numerous internal hypertext links: i) direct links and ii) call for links from a given document to a wide range of other documents (ex: tumors associated with a given gene). The Atlas contains more than 17,000 internal links. It is bound to get enriched from its own contents by meta-analysis scripts.

Gestion of external links towards the majority of interesting public databases is automatic.

The Atlas is referenced by other web sites (NextProt, GeneCards, Mitelman database, COSMIC, etc.).

The Atlas benefits from the material support of INIST (CNRS-INSERM), allowing an excellent internet access (more than 1.5 million visits in a year).

 

In conclusion: original database with no equivalent or concurrence, therefore necessary, besides several other databases, for an in-depth analysis (ex: Atlas + Mitelman + Gene at NCBI + Cosmic).

 

… However …

 

Lack of exhaustivity and lack of update of the existing documents (due to lack of staff and the amount of work to be done - the Atlas has already more than 50,000 pages).

Need for the database restructuration, with an editorial system and a more performing ergonomics of consultation.

A need, very soon, for new developments concerning new techniques in genetics.

 

Note: Some addresses

- New developments:http://atlasgeneticsoncology.org/News.html

- Atlas Journal Pdf/Scientific side of the Atlas:http://documents.irevues.inist.fr/handle/2042/15655

e.g. http://documents.irevues.inist.fr/bitstream/handle/2042/56473/vol_19_4_2015.pdf

- Cell Biology: e.g.http://atlasgeneticsoncology.org/Categories/DNA_repair.html

- Cell lines:http://atlasgeneticsoncology.org/cell_lines.html

- Nosology: http://atlasgeneticsoncology.org/Tumors/Solid_Nosology_9.html

- Physical maps of genes, markers, and diseases at the level of the chromosome band: e.g. http://atlasgeneticsoncology.org/Bands/21q22.html

 

 



[1]Given the magnitude of the task, it could be very tempting to limit the Atlas to only a part of it, let's say the leukemias part.

However, it would ruin a specificity/originality of the Atlas: bringing together the diverse and complementary knowledge, to offer a broader view concerning cancer genetics processes. All the more so as a gene can be translocated in a leukemia and overexpressed in a carcinoma, or the opposite, and as the t(16;21)(p11;q22) FUS/ERG may be found in the Ewing?PNET spectrum and in acute myeloid leukemia!.

To all Cytogeneticists, Geneticists, Clinicians, and Scientists for whom the Atlas of Genetics and Cytogenetics in Oncology and Hematology still represents a cultural and practical tool ...

see: http://atlasgeneticsoncology.org//Supporting_Atlas_First_signatories.pdf