Identifying the need for a new LIMS - The University of ...



For Table of Contents Use OnlyLeaf LIMS: A flexible laboratory information management system with a synthetic biology focusThomas Craig, Richard Holland, Rosalinda D’Amore, James R. Johnson, Hannah V. McCue, Anthony West, Valentin Zulkower, Hille Tekotte, Yizhi Cai, Daniel Swan, Robert P Davey, Christiane Hertz-Fowler, Anthony Hall, Mark Caddick Leaf LIMS: A flexible laboratory information management system with a synthetic biology focusThomas Craig1, Richard Holland2, Rosalinda D’Amore3, James R. Johnson1, Hannah V. McCue1, Anthony West4, Valentin Zulkower5, Hille Tekotte5, Yizhi Cai5, Daniel Swan4, Robert P Davey4, Christiane Hertz-Fowler3, Anthony Hall4, Mark Caddick1 1GeneMill, University of Liverpool, L69 7ZB, UK2New Forest Ventures Ltd., Fordingbridge, UK3Centre for Genomic Research, University of Liverpool, L69 7ZB, UK4Earlham Institute, Norwich Research Park Innovation Centre, Colney Ln, Norwich NR4 7UH, UK5Edinburgh Genome Foundry, Michael Swann Building, Max Born Crescent, Edinburgh, EH9 3BF, UK Keywords: Lab Information Management System, Synthetic Biology, Data Management, LIMS, AutomationThis paper presents Leaf LIMS, a flexible laboratory information management system (LIMS) designed to address the complexity of synthetic biology workflows. At the project’s inception there was a lack of a LIMS designed specifically to address synthetic biology processes, with most systems focused on either next generation sequencing or biobanks and clinical sample handling. Leaf LIMS implements integrated project, item and laboratory stock tracking, offering complete sample and construct genealogy, materials and lot tracking, and modular assay data capture. Hence, it enables highly configurable task-based workflows and supports data capture from project inception to completion. As such, in addition to it supporting synthetic biology it is ideal for many laboratory environments with multiple projects and users. The system is deployed as a web application through Docker and is provided under a permissive MIT licence. It is freely available for download at biology takes an engineering focused approach to the manipulation of genetic information within an organism. Functional DNA parts are identified, characterised and converted into a format that allows them to be combined into multiple combinations as functional constructs with predictable properties PEVuZE5vdGU+PENpdGU+PEF1dGhvcj5FbmdsZXI8L0F1dGhvcj48WWVhcj4yMDE0PC9ZZWFyPjxJ

RFRleHQ+QSBnb2xkZW4gZ2F0ZSBtb2R1bGFyIGNsb25pbmcgdG9vbGJveCBmb3IgcGxhbnRzPC9J

RFRleHQ+PERpc3BsYXlUZXh0PjxzdHlsZSBmYWNlPSJpdGFsaWMgc3VwZXJzY3JpcHQiPjEsIDI8

L3N0eWxlPjwvRGlzcGxheVRleHQ+PHJlY29yZD48ZGF0ZXM+PHB1Yi1kYXRlcz48ZGF0ZT5Ob3Y8

L2RhdGU+PC9wdWItZGF0ZXM+PHllYXI+MjAxNDwveWVhcj48L2RhdGVzPjxrZXl3b3Jkcz48a2V5

d29yZD5BZ3JvYmFjdGVyaXVtIHR1bWVmYWNpZW5zPC9rZXl3b3JkPjxrZXl3b3JkPkNsb25pbmcs

IE1vbGVjdWxhcjwva2V5d29yZD48a2V5d29yZD5HZW5ldGljIEVuZ2luZWVyaW5nPC9rZXl3b3Jk

PjxrZXl3b3JkPkdlbmV0aWMgVmVjdG9yczwva2V5d29yZD48a2V5d29yZD5Nb2RlbHMsIEdlbmV0

aWM8L2tleXdvcmQ+PGtleXdvcmQ+U3ludGhldGljIEJpb2xvZ3k8L2tleXdvcmQ+PGtleXdvcmQ+

VG9iYWNjbzwva2V5d29yZD48L2tleXdvcmRzPjx1cmxzPjxyZWxhdGVkLXVybHM+PHVybD5odHRw

czovL3d3dy5uY2JpLm5sbS5uaWguZ292L3B1Ym1lZC8yNDkzMzEyNDwvdXJsPjwvcmVsYXRlZC11

cmxzPjwvdXJscz48aXNibj4yMTYxLTUwNjM8L2lzYm4+PHRpdGxlcz48dGl0bGU+QSBnb2xkZW4g

Z2F0ZSBtb2R1bGFyIGNsb25pbmcgdG9vbGJveCBmb3IgcGxhbnRzPC90aXRsZT48c2Vjb25kYXJ5

LXRpdGxlPkFDUyBTeW50aCBCaW9sPC9zZWNvbmRhcnktdGl0bGU+PC90aXRsZXM+PHBhZ2VzPjgz

OS00MzwvcGFnZXM+PG51bWJlcj4xMTwvbnVtYmVyPjxjb250cmlidXRvcnM+PGF1dGhvcnM+PGF1

dGhvcj5FbmdsZXIsIEMuPC9hdXRob3I+PGF1dGhvcj5Zb3VsZXMsIE0uPC9hdXRob3I+PGF1dGhv

cj5HcnVldHpuZXIsIFIuPC9hdXRob3I+PGF1dGhvcj5FaG5lcnQsIFQuIE0uPC9hdXRob3I+PGF1

dGhvcj5XZXJuZXIsIFMuPC9hdXRob3I+PGF1dGhvcj5Kb25lcywgSi4gRC48L2F1dGhvcj48YXV0

aG9yPlBhdHJvbiwgTi4gSi48L2F1dGhvcj48YXV0aG9yPk1hcmlsbG9ubmV0LCBTLjwvYXV0aG9y

PjwvYXV0aG9ycz48L2NvbnRyaWJ1dG9ycz48ZWRpdGlvbj4yMDE0LzAyLzIwPC9lZGl0aW9uPjxs

YW5ndWFnZT5lbmc8L2xhbmd1YWdlPjxhZGRlZC1kYXRlIGZvcm1hdD0idXRjIj4xNDkwMDk0OTEw

PC9hZGRlZC1kYXRlPjxyZWYtdHlwZSBuYW1lPSJKb3VybmFsIEFydGljbGUiPjE3PC9yZWYtdHlw

ZT48cmVjLW51bWJlcj45PC9yZWMtbnVtYmVyPjxsYXN0LXVwZGF0ZWQtZGF0ZSBmb3JtYXQ9InV0

YyI+MTQ5MDA5NDkxMDwvbGFzdC11cGRhdGVkLWRhdGU+PGFjY2Vzc2lvbi1udW0+MjQ5MzMxMjQ8

L2FjY2Vzc2lvbi1udW0+PGVsZWN0cm9uaWMtcmVzb3VyY2UtbnVtPjEwLjEwMjEvc2I0MDAxNTA0

PC9lbGVjdHJvbmljLXJlc291cmNlLW51bT48dm9sdW1lPjM8L3ZvbHVtZT48L3JlY29yZD48L0Np

dGU+PENpdGU+PEF1dGhvcj5MZWU8L0F1dGhvcj48WWVhcj4yMDE1PC9ZZWFyPjxJRFRleHQ+QSBI

aWdobHkgQ2hhcmFjdGVyaXplZCBZZWFzdCBUb29sa2l0IGZvciBNb2R1bGFyLCBNdWx0aXBhcnQg

QXNzZW1ibHk8L0lEVGV4dD48cmVjb3JkPjxkYXRlcz48cHViLWRhdGVzPjxkYXRlPlNlcDwvZGF0

ZT48L3B1Yi1kYXRlcz48eWVhcj4yMDE1PC95ZWFyPjwvZGF0ZXM+PGtleXdvcmRzPjxrZXl3b3Jk

PkROQSBDb3B5IE51bWJlciBWYXJpYXRpb25zPC9rZXl3b3JkPjxrZXl3b3JkPkROQSwgRnVuZ2Fs

PC9rZXl3b3JkPjxrZXl3b3JkPkVzY2hlcmljaGlhIGNvbGk8L2tleXdvcmQ+PGtleXdvcmQ+R2Vu

ZSBFeHByZXNzaW9uIFJlZ3VsYXRpb24sIEZ1bmdhbDwva2V5d29yZD48a2V5d29yZD5HZW5ldGlj

IEVuZ2luZWVyaW5nPC9rZXl3b3JkPjxrZXl3b3JkPk11bHRpZ2VuZSBGYW1pbHk8L2tleXdvcmQ+

PGtleXdvcmQ+UGxhc21pZHM8L2tleXdvcmQ+PGtleXdvcmQ+UHJvbW90ZXIgUmVnaW9ucywgR2Vu

ZXRpYzwva2V5d29yZD48a2V5d29yZD5TYWNjaGFyb215Y2VzIGNlcmV2aXNpYWU8L2tleXdvcmQ+

PGtleXdvcmQ+VHJhbnNjcmlwdGlvbmFsIEFjdGl2YXRpb248L2tleXdvcmQ+PC9rZXl3b3Jkcz48

dXJscz48cmVsYXRlZC11cmxzPjx1cmw+aHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9wdWJt

ZWQvMjU4NzE0MDU8L3VybD48L3JlbGF0ZWQtdXJscz48L3VybHM+PGlzYm4+MjE2MS01MDYzPC9p

c2JuPjx0aXRsZXM+PHRpdGxlPkEgSGlnaGx5IENoYXJhY3Rlcml6ZWQgWWVhc3QgVG9vbGtpdCBm

b3IgTW9kdWxhciwgTXVsdGlwYXJ0IEFzc2VtYmx5PC90aXRsZT48c2Vjb25kYXJ5LXRpdGxlPkFD

UyBTeW50aCBCaW9sPC9zZWNvbmRhcnktdGl0bGU+PC90aXRsZXM+PHBhZ2VzPjk3NS04NjwvcGFn

ZXM+PG51bWJlcj45PC9udW1iZXI+PGNvbnRyaWJ1dG9ycz48YXV0aG9ycz48YXV0aG9yPkxlZSwg

TS4gRS48L2F1dGhvcj48YXV0aG9yPkRlTG9hY2hlLCBXLiBDLjwvYXV0aG9yPjxhdXRob3I+Q2Vy

dmFudGVzLCBCLjwvYXV0aG9yPjxhdXRob3I+RHVlYmVyLCBKLiBFLjwvYXV0aG9yPjwvYXV0aG9y

cz48L2NvbnRyaWJ1dG9ycz48ZWRpdGlvbj4yMDE1LzA1LzAxPC9lZGl0aW9uPjxsYW5ndWFnZT5l

bmc8L2xhbmd1YWdlPjxhZGRlZC1kYXRlIGZvcm1hdD0idXRjIj4xNDkwMDk0ODY3PC9hZGRlZC1k

YXRlPjxyZWYtdHlwZSBuYW1lPSJKb3VybmFsIEFydGljbGUiPjE3PC9yZWYtdHlwZT48cmVjLW51

bWJlcj44PC9yZWMtbnVtYmVyPjxsYXN0LXVwZGF0ZWQtZGF0ZSBmb3JtYXQ9InV0YyI+MTQ5MDA5

NDg2NzwvbGFzdC11cGRhdGVkLWRhdGU+PGFjY2Vzc2lvbi1udW0+MjU4NzE0MDU8L2FjY2Vzc2lv

bi1udW0+PGVsZWN0cm9uaWMtcmVzb3VyY2UtbnVtPjEwLjEwMjEvc2I1MDAzNjZ2PC9lbGVjdHJv

bmljLXJlc291cmNlLW51bT48dm9sdW1lPjQ8L3ZvbHVtZT48L3JlY29yZD48L0NpdGU+PC9FbmRO

b3RlPn==

ADDIN EN.CITE PEVuZE5vdGU+PENpdGU+PEF1dGhvcj5FbmdsZXI8L0F1dGhvcj48WWVhcj4yMDE0PC9ZZWFyPjxJ

RFRleHQ+QSBnb2xkZW4gZ2F0ZSBtb2R1bGFyIGNsb25pbmcgdG9vbGJveCBmb3IgcGxhbnRzPC9J

RFRleHQ+PERpc3BsYXlUZXh0PjxzdHlsZSBmYWNlPSJpdGFsaWMgc3VwZXJzY3JpcHQiPjEsIDI8

L3N0eWxlPjwvRGlzcGxheVRleHQ+PHJlY29yZD48ZGF0ZXM+PHB1Yi1kYXRlcz48ZGF0ZT5Ob3Y8

L2RhdGU+PC9wdWItZGF0ZXM+PHllYXI+MjAxNDwveWVhcj48L2RhdGVzPjxrZXl3b3Jkcz48a2V5

d29yZD5BZ3JvYmFjdGVyaXVtIHR1bWVmYWNpZW5zPC9rZXl3b3JkPjxrZXl3b3JkPkNsb25pbmcs

IE1vbGVjdWxhcjwva2V5d29yZD48a2V5d29yZD5HZW5ldGljIEVuZ2luZWVyaW5nPC9rZXl3b3Jk

PjxrZXl3b3JkPkdlbmV0aWMgVmVjdG9yczwva2V5d29yZD48a2V5d29yZD5Nb2RlbHMsIEdlbmV0

aWM8L2tleXdvcmQ+PGtleXdvcmQ+U3ludGhldGljIEJpb2xvZ3k8L2tleXdvcmQ+PGtleXdvcmQ+

VG9iYWNjbzwva2V5d29yZD48L2tleXdvcmRzPjx1cmxzPjxyZWxhdGVkLXVybHM+PHVybD5odHRw

czovL3d3dy5uY2JpLm5sbS5uaWguZ292L3B1Ym1lZC8yNDkzMzEyNDwvdXJsPjwvcmVsYXRlZC11

cmxzPjwvdXJscz48aXNibj4yMTYxLTUwNjM8L2lzYm4+PHRpdGxlcz48dGl0bGU+QSBnb2xkZW4g

Z2F0ZSBtb2R1bGFyIGNsb25pbmcgdG9vbGJveCBmb3IgcGxhbnRzPC90aXRsZT48c2Vjb25kYXJ5

LXRpdGxlPkFDUyBTeW50aCBCaW9sPC9zZWNvbmRhcnktdGl0bGU+PC90aXRsZXM+PHBhZ2VzPjgz

OS00MzwvcGFnZXM+PG51bWJlcj4xMTwvbnVtYmVyPjxjb250cmlidXRvcnM+PGF1dGhvcnM+PGF1

dGhvcj5FbmdsZXIsIEMuPC9hdXRob3I+PGF1dGhvcj5Zb3VsZXMsIE0uPC9hdXRob3I+PGF1dGhv

cj5HcnVldHpuZXIsIFIuPC9hdXRob3I+PGF1dGhvcj5FaG5lcnQsIFQuIE0uPC9hdXRob3I+PGF1

dGhvcj5XZXJuZXIsIFMuPC9hdXRob3I+PGF1dGhvcj5Kb25lcywgSi4gRC48L2F1dGhvcj48YXV0

aG9yPlBhdHJvbiwgTi4gSi48L2F1dGhvcj48YXV0aG9yPk1hcmlsbG9ubmV0LCBTLjwvYXV0aG9y

PjwvYXV0aG9ycz48L2NvbnRyaWJ1dG9ycz48ZWRpdGlvbj4yMDE0LzAyLzIwPC9lZGl0aW9uPjxs

YW5ndWFnZT5lbmc8L2xhbmd1YWdlPjxhZGRlZC1kYXRlIGZvcm1hdD0idXRjIj4xNDkwMDk0OTEw

PC9hZGRlZC1kYXRlPjxyZWYtdHlwZSBuYW1lPSJKb3VybmFsIEFydGljbGUiPjE3PC9yZWYtdHlw

ZT48cmVjLW51bWJlcj45PC9yZWMtbnVtYmVyPjxsYXN0LXVwZGF0ZWQtZGF0ZSBmb3JtYXQ9InV0

YyI+MTQ5MDA5NDkxMDwvbGFzdC11cGRhdGVkLWRhdGU+PGFjY2Vzc2lvbi1udW0+MjQ5MzMxMjQ8

L2FjY2Vzc2lvbi1udW0+PGVsZWN0cm9uaWMtcmVzb3VyY2UtbnVtPjEwLjEwMjEvc2I0MDAxNTA0

PC9lbGVjdHJvbmljLXJlc291cmNlLW51bT48dm9sdW1lPjM8L3ZvbHVtZT48L3JlY29yZD48L0Np

dGU+PENpdGU+PEF1dGhvcj5MZWU8L0F1dGhvcj48WWVhcj4yMDE1PC9ZZWFyPjxJRFRleHQ+QSBI

aWdobHkgQ2hhcmFjdGVyaXplZCBZZWFzdCBUb29sa2l0IGZvciBNb2R1bGFyLCBNdWx0aXBhcnQg

QXNzZW1ibHk8L0lEVGV4dD48cmVjb3JkPjxkYXRlcz48cHViLWRhdGVzPjxkYXRlPlNlcDwvZGF0

ZT48L3B1Yi1kYXRlcz48eWVhcj4yMDE1PC95ZWFyPjwvZGF0ZXM+PGtleXdvcmRzPjxrZXl3b3Jk

PkROQSBDb3B5IE51bWJlciBWYXJpYXRpb25zPC9rZXl3b3JkPjxrZXl3b3JkPkROQSwgRnVuZ2Fs

PC9rZXl3b3JkPjxrZXl3b3JkPkVzY2hlcmljaGlhIGNvbGk8L2tleXdvcmQ+PGtleXdvcmQ+R2Vu

ZSBFeHByZXNzaW9uIFJlZ3VsYXRpb24sIEZ1bmdhbDwva2V5d29yZD48a2V5d29yZD5HZW5ldGlj

IEVuZ2luZWVyaW5nPC9rZXl3b3JkPjxrZXl3b3JkPk11bHRpZ2VuZSBGYW1pbHk8L2tleXdvcmQ+

PGtleXdvcmQ+UGxhc21pZHM8L2tleXdvcmQ+PGtleXdvcmQ+UHJvbW90ZXIgUmVnaW9ucywgR2Vu

ZXRpYzwva2V5d29yZD48a2V5d29yZD5TYWNjaGFyb215Y2VzIGNlcmV2aXNpYWU8L2tleXdvcmQ+

PGtleXdvcmQ+VHJhbnNjcmlwdGlvbmFsIEFjdGl2YXRpb248L2tleXdvcmQ+PC9rZXl3b3Jkcz48

dXJscz48cmVsYXRlZC11cmxzPjx1cmw+aHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9wdWJt

ZWQvMjU4NzE0MDU8L3VybD48L3JlbGF0ZWQtdXJscz48L3VybHM+PGlzYm4+MjE2MS01MDYzPC9p

c2JuPjx0aXRsZXM+PHRpdGxlPkEgSGlnaGx5IENoYXJhY3Rlcml6ZWQgWWVhc3QgVG9vbGtpdCBm

b3IgTW9kdWxhciwgTXVsdGlwYXJ0IEFzc2VtYmx5PC90aXRsZT48c2Vjb25kYXJ5LXRpdGxlPkFD

UyBTeW50aCBCaW9sPC9zZWNvbmRhcnktdGl0bGU+PC90aXRsZXM+PHBhZ2VzPjk3NS04NjwvcGFn

ZXM+PG51bWJlcj45PC9udW1iZXI+PGNvbnRyaWJ1dG9ycz48YXV0aG9ycz48YXV0aG9yPkxlZSwg

TS4gRS48L2F1dGhvcj48YXV0aG9yPkRlTG9hY2hlLCBXLiBDLjwvYXV0aG9yPjxhdXRob3I+Q2Vy

dmFudGVzLCBCLjwvYXV0aG9yPjxhdXRob3I+RHVlYmVyLCBKLiBFLjwvYXV0aG9yPjwvYXV0aG9y

cz48L2NvbnRyaWJ1dG9ycz48ZWRpdGlvbj4yMDE1LzA1LzAxPC9lZGl0aW9uPjxsYW5ndWFnZT5l

bmc8L2xhbmd1YWdlPjxhZGRlZC1kYXRlIGZvcm1hdD0idXRjIj4xNDkwMDk0ODY3PC9hZGRlZC1k

YXRlPjxyZWYtdHlwZSBuYW1lPSJKb3VybmFsIEFydGljbGUiPjE3PC9yZWYtdHlwZT48cmVjLW51

bWJlcj44PC9yZWMtbnVtYmVyPjxsYXN0LXVwZGF0ZWQtZGF0ZSBmb3JtYXQ9InV0YyI+MTQ5MDA5

NDg2NzwvbGFzdC11cGRhdGVkLWRhdGU+PGFjY2Vzc2lvbi1udW0+MjU4NzE0MDU8L2FjY2Vzc2lv

bi1udW0+PGVsZWN0cm9uaWMtcmVzb3VyY2UtbnVtPjEwLjEwMjEvc2I1MDAzNjZ2PC9lbGVjdHJv

bmljLXJlc291cmNlLW51bT48dm9sdW1lPjQ8L3ZvbHVtZT48L3JlY29yZD48L0NpdGU+PC9FbmRO

b3RlPn==

ADDIN EN.CITE.DATA 1, 2. Using Golden Gate, Gibson and similar assembly methods ADDIN EN.CITE <EndNote><Cite><Author>Casini</Author><Year>2015</Year><IDText>Bricks and blueprints: methods and standards for DNA assembly</IDText><DisplayText><style face="italic superscript">3</style></DisplayText><record><dates><pub-dates><date>09</date></pub-dates><year>2015</year></dates><keywords><keyword>Cloning, Molecular</keyword><keyword>Endonucleases</keyword><keyword>Genetic Engineering</keyword><keyword>Recombination, Genetic</keyword><keyword>Reference Standards</keyword><keyword>Synthetic Biology</keyword></keywords><urls><related-urls><url> and blueprints: methods and standards for DNA assembly</title><secondary-title>Nat Rev Mol Cell Biol</secondary-title></titles><pages>568-76</pages><number>9</number><contributors><authors><author>Casini, A.</author><author>Storch, M.</author><author>Baldwin, G. S.</author><author>Ellis, T.</author></authors></contributors><edition>2015/06/17</edition><language>eng</language><added-date format="utc">1490108177</added-date><ref-type name="Journal Article">17</ref-type><rec-number>11</rec-number><last-updated-date format="utc">1490108177</last-updated-date><accession-num>26081612</accession-num><electronic-resource-num>10.1038/nrm4014</electronic-resource-num><volume>16</volume></record></Cite></EndNote>3 these parts can be used to efficiently build variations of a construct allowing for rapid assembly, prototyping and optimisation of constructs, including complex gene clusters ADDIN EN.CITE <EndNote><Cite><Author>Smanski</Author><Year>2014</Year><IDText>Functional optimization of gene clusters by combinatorial design and assembly</IDText><DisplayText><style face="italic superscript">4</style></DisplayText><record><dates><pub-dates><date>Dec</date></pub-dates><year>2014</year></dates><keywords><keyword>High-Throughput Nucleotide Sequencing</keyword><keyword>Klebsiella oxytoca</keyword><keyword>Multigene Family</keyword><keyword>Nitrogen Fixation</keyword><keyword>Nitrogenase</keyword><keyword>Operon</keyword><keyword>Promoter Regions, Genetic</keyword><keyword>Transcription, Genetic</keyword></keywords><urls><related-urls><url> optimization of gene clusters by combinatorial design and assembly</title><secondary-title>Nat Biotechnol</secondary-title></titles><pages>1241-9</pages><number>12</number><contributors><authors><author>Smanski, M. J.</author><author>Bhatia, S.</author><author>Zhao, D.</author><author>Park, Y.</author><author>B A Woodruff, L.</author><author>Giannoukos, G.</author><author>Ciulla, D.</author><author>Busby, M.</author><author>Calderon, J.</author><author>Nicol, R.</author><author>Gordon, D. B.</author><author>Densmore, D.</author><author>Voigt, C. A.</author></authors></contributors><edition>2014/11/24</edition><language>eng</language><added-date format="utc">1490108424</added-date><ref-type name="Journal Article">17</ref-type><rec-number>12</rec-number><last-updated-date format="utc">1490108424</last-updated-date><accession-num>25419741</accession-num><electronic-resource-num>10.1038/nbt.3063</electronic-resource-num><volume>32</volume></record></Cite></EndNote>4. In addition, such parts can be distributed, either as a toolkit or individual parts and thus can be integrated easily into other projects. A significant amount of data is produced during these processes which can then be used to inform decisions in the design–build–test cycle, for instance through automated validation of bining the part-based assembly with the increasing use of robotic equipment unlocks the potential for a high-throughput construct creation, transfection into, and analytical measurement of, the recipient chassis organism. Automated laboratory equipment can quickly and repeatedly perform the tasks required to assemble a construct, often using smaller volumes, therefore making it both cost and time efficient ADDIN EN.CITE <EndNote><Cite><Author>Kanigowska</Author><Year>2016</Year><IDText>Smart DNA Fabrication Using Sound Waves: Applying Acoustic Dispensing Technologies to Synthetic Biology</IDText><DisplayText><style face="italic superscript">5</style></DisplayText><record><dates><pub-dates><date>Feb</date></pub-dates><year>2016</year></dates><keywords><keyword>Acoustics</keyword><keyword>Biomedical Technology</keyword><keyword>Molecular Biology</keyword><keyword>Solutions</keyword><keyword>Sound</keyword><keyword>Synthetic Biology</keyword></keywords><urls><related-urls><url> DNA Fabrication Using Sound Waves: Applying Acoustic Dispensing Technologies to Synthetic Biology</title><secondary-title>J Lab Autom</secondary-title></titles><pages>49-56</pages><number>1</number><contributors><authors><author>Kanigowska, P.</author><author>Shen, Y.</author><author>Zheng, Y.</author><author>Rosser, S.</author><author>Cai, Y.</author></authors></contributors><edition>2015/07/10</edition><language>eng</language><added-date format="utc">1490111405</added-date><ref-type name="Journal Article">17</ref-type><rec-number>13</rec-number><last-updated-date format="utc">1490111405</last-updated-date><accession-num>26163567</accession-num><electronic-resource-num>10.1177/2211068215593754</electronic-resource-num><volume>21</volume></record></Cite></EndNote>5.This increase in scale also leads to an increased level of complexity with respect to parts, processes and data, all of which need to be recorded and tracked within the laboratory. With a single construct it is relatively easy to keep track of what is required, something that swiftly becomes difficult as variations expand to encompass multiple combinations of components to formulate and optimise, for example, entire gene clusters (Fig. 1). A key issue is how to achieve the necessary management of this information; tracking the usage of the various parts and transitory elements through the process. The answer to this is the development and use of a Laboratory Information Management System (LIMS) to manage the production of the constructs and maximise rmation management systems already exist for various other disciplines, such as MISO ADDIN EN.CITE <EndNote><Cite><Author>Earlham Institute</Author><Year>2017</Year><IDText>MISO (Managing Information for Sequencing Operations)</IDText><DisplayText><style face="italic superscript">6</style></DisplayText><record><titles><title><style font="default" size="100%">MISO (Managing Information for Sequencing Operations</style><style face="italic" font="default" size="100%">)</style></title></titles><contributors><authors><author>Earlham Institute,</author></authors></contributors><added-date format="utc">1488547048</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>4</rec-number><last-updated-date format="utc">1492078262</last-updated-date><volume>2017</volume></record></Cite></EndNote>6 which has been designed for Next Generation Sequencing (NGS) workflows and PIMS ADDIN EN.CITE <EndNote><Cite><Author>Savitsky</Author><Year>2011</Year><IDText>Recording information on protein complexes in an information management system</IDText><DisplayText><style face="italic superscript">7</style></DisplayText><record><dates><pub-dates><date>Aug</date></pub-dates><year>2011</year></dates><keywords><keyword>Database Management Systems</keyword><keyword>Databases, Protein</keyword><keyword>Information Management</keyword><keyword>Multiprotein Complexes</keyword><keyword>Protein Conformation</keyword><keyword>User-Computer Interface</keyword><keyword>Workflow</keyword></keywords><urls><related-urls><url> information on protein complexes in an information management system</title><secondary-title>J Struct Biol</secondary-title></titles><pages>224-9</pages><number>2</number><contributors><authors><author>Savitsky, M.</author><author>Diprose, J. M.</author><author>Morris, C.</author><author>Griffiths, S. L.</author><author>Daniel, E.</author><author>Lin, B.</author><author>Daenke, S.</author><author>Bishop, B.</author><author>Siebold, C.</author><author>Wilson, K. S.</author><author>Blake, R.</author><author>Stuart, D. I.</author><author>Esnouf, R. M.</author></authors></contributors><edition>2011/05/14</edition><language>eng</language><added-date format="utc">1492071326</added-date><ref-type name="Journal Article">17</ref-type><rec-number>14</rec-number><last-updated-date format="utc">1492071326</last-updated-date><accession-num>21605682</accession-num><electronic-resource-num>10.1016/j.jsb.2011.05.009</electronic-resource-num><volume>175</volume></record></Cite></EndNote>7 for managing protein production. While it should be possible to adapt an existing LIMS to handle this complexity, it is more effective to design a system from the ground up that can be used as a single package rather than following instructions to replicate an existing configuration (e.g. installation of plugins) in another piece of software. We therefore took this approach and created a LIMS system that is freely available for the community to use.Figure 1: A simplified and condensed version of a synthetic biology workflow for optimising protein expression. In part A multiple constructs are assembled, each a variation centred on expressing a specific coding sequence (CDS) using differing promoter and ribosome binding site (RBS), C-terminal epitope tag and terminator parts. After transformation, resulting colonies are checked by colony PCR. At the culture phase, three incorrect constructs are discarded and the remainder are used for DNA minipreps. In part B the construct DNAs are transformed into a cell line suitable for protein expression. Expression of the protein is then analysed under a variety of conditions, in this case using a Biolector microbioreactor (m2p-labs). A significant number of conditions could be tested for each construct and may include culture medium, temperature, bacterial chassis and oxygen. The most highly expressing construct is then picked for large scale expression and purification in C. If in part B no construct had yielded the required output the process would then go back to A with new designs based on the best performing variants. This is representative of the design - build - test cycle that forms an integral part of synthetic biology ADDIN EN.CITE <EndNote><Cite><Author>Johnson</Author><Year>2016</Year><IDText>GeneMill: A 21st century platform for innovation</IDText><DisplayText><style face="italic superscript">8</style></DisplayText><record><dates><pub-dates><date>Jun</date></pub-dates><year>2016</year></dates><urls><related-urls><url>: A 21st century platform for innovation</title><secondary-title>Biochem Soc Trans</secondary-title></titles><pages>681-3</pages><number>3</number><contributors><authors><author>Johnson, J. R.</author><author>D&apos;Amore, R.</author><author>Thain, S. C.</author><author>Craig, T.</author><author>McCue, H. V.</author><author>Hertz-Fowler, C.</author><author>Hall, N.</author><author>Hall, A. J.</author></authors></contributors><language>eng</language><added-date format="utc">1492072914</added-date><ref-type name="Journal Article">17</ref-type><rec-number>19</rec-number><last-updated-date format="utc">1492072914</last-updated-date><accession-num>27284025</accession-num><electronic-resource-num>10.1042/BST20160012</electronic-resource-num><volume>44</volume></record></Cite></EndNote>8. In part C the optimal construct and conditions identified in part B are taken and expressed to generate larger quantities of the required protein. The symbols used correspond to standard SBOL visual with the addition of “c” to a CDS to indicate a C-terminal epitope tag.Results and DiscussionIdentifying the need for a new LIMSBy looking at the various processes taking place in a synthetic biology laboratory setting, the following specifications were drawn up for the laboratory management system: Firstly, multiple forms of data need to be stored (e.g. project metadata, experimental designs) and any material received, tracked (e.g. DNA Parts, design data, primers). Secondly, outputs produced during each process (e.g. colonies, PCR products) and the data produced by the workflows themselves require tracking as these might be utilised in later steps. Thirdly, the LIMS needed to record information on the above, for instance validation such as sequencing or data describing functionality. Lastly, the system needed to be flexible and scalable to handle both manual and automated data entry. The system also needed to be flexibly designed such that new information, components and methods within the rapidly changing field of synthetic biology can be accommodated.With the requirements set, existing systems were evaluated to assess if they offered the necessary features. Existing commercial LIMS such as GenoLogics Clarity ADDIN EN.CITE <EndNote><Cite><Author>GenoLogics</Author><Year>2017</Year><IDText>BaseSpace Clarity LIMS</IDText><DisplayText><style face="italic superscript">9</style></DisplayText><record><urls><related-urls><url> Clarity LIMS</title></titles><number>2017</number><contributors><authors><author>GenoLogics</author></authors></contributors><added-date format="utc">1492071515</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>15</rec-number><last-updated-date format="utc">1492075869</last-updated-date></record></Cite></EndNote>9 are geared towards the NGS market and lacked the flexibility required to support synthetic biology workflows. This was also the case with the open source MISO system ADDIN EN.CITE <EndNote><Cite><Author>Earlham Institute</Author><Year>2017</Year><IDText>MISO (Managing Information for Sequencing Operations)</IDText><DisplayText><style face="italic superscript">6</style></DisplayText><record><titles><title><style font="default" size="100%">MISO (Managing Information for Sequencing Operations</style><style face="italic" font="default" size="100%">)</style></title></titles><contributors><authors><author>Earlham Institute,</author></authors></contributors><added-date format="utc">1488547048</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>4</rec-number><last-updated-date format="utc">1492078262</last-updated-date><volume>2017</volume></record></Cite></EndNote>6, designed to efficiently handle NGS projects and hence would require a complete re-write. Various other open source projects such as openBIS ADDIN EN.CITE <EndNote><Cite><Author>Barillari</Author><Year>2016</Year><IDText>openBIS ELN-LIMS: an open-source database for academic laboratories</IDText><DisplayText><style face="italic superscript">10</style></DisplayText><record><dates><pub-dates><date>Feb</date></pub-dates><year>2016</year></dates><keywords><keyword>Biological Science Disciplines</keyword><keyword>Computer Systems</keyword><keyword>Databases, Factual</keyword><keyword>Information Management</keyword><keyword>Information Systems</keyword><keyword>Laboratories</keyword><keyword>Software</keyword><keyword>User-Computer Interface</keyword></keywords><urls><related-urls><url> ELN-LIMS: an open-source database for academic laboratories</title><secondary-title>Bioinformatics</secondary-title></titles><pages>638-40</pages><number>4</number><contributors><authors><author>Barillari, C.</author><author>Ottoz, D. S.</author><author>Fuentes-Serna, J. M.</author><author>Ramakrishnan, C.</author><author>Rinn, B.</author><author>Rudolf, F.</author></authors></contributors><edition>2015/10/27</edition><language>eng</language><added-date format="utc">1492073494</added-date><ref-type name="Journal Article">17</ref-type><rec-number>20</rec-number><last-updated-date format="utc">1492073494</last-updated-date><accession-num>26508761</accession-num><electronic-resource-num>10.1093/bioinformatics/btv606</electronic-resource-num><volume>32</volume></record></Cite></EndNote>10, PIMS and BikaLIMS ADDIN EN.CITE <EndNote><Cite><Author>BikaLabs</Author><Year>2017</Year><IDText>BikaLIMS</IDText><DisplayText><style face="italic superscript">11</style></DisplayText><record><urls><related-urls><url> format="utc">1492073799</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>21</rec-number><last-updated-date format="utc">1492073835</last-updated-date><volume>2017</volume></record></Cite></EndNote>11 provide some of the feature set such as sample and project tracking but were again designed to suit either a more clinical setting or towards a specific purpose (e.g. protein structure analysis). JIRA ADDIN EN.CITE <EndNote><Cite><Author>Atlassian</Author><Year>2017</Year><IDText>JIRA Software - Issue and Project Tracking for Software Teams</IDText><DisplayText><style face="italic superscript">12</style></DisplayText><record><urls><related-urls><url> Software - Issue and Project Tracking for Software Teams</title></titles><contributors><authors><author>Atlassian</author></authors></contributors><added-date format="utc">1492073893</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>22</rec-number><last-updated-date format="utc">1492073980</last-updated-date></record></Cite></EndNote>12, an issue management tool from Atlassian, was also evaluated. While more flexible than most of the identified tools, it required too many workarounds to be used as a viable long-term solution. There are other technologies that currently show strong promise, such as Antha ADDIN EN.CITE <EndNote><Cite><Author>Synthace</Author><Year>2017</Year><IDText>About Antha</IDText><DisplayText><style face="italic superscript">13</style></DisplayText><record><urls><related-urls><url> Antha</title></titles><contributors><authors><author>Synthace</author></authors></contributors><added-date format="utc">1488453727</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>1</rec-number><publisher>Synthace</publisher><last-updated-date format="utc">1488469475</last-updated-date></record></Cite></EndNote>13 from Synthace. This provides an open source biology focused language for developing protocols but it requires the use of the closed-source “Antha OS” to provide a user interface and enable its use for a LIMS like system. The closed-source nature of this project therefore limits input from relevant entities such as academic laboratories. For synthetic biology, some resources have been developed such as ICE ADDIN EN.CITE <EndNote><Cite><Author>Ham</Author><Year>2012</Year><IDText>Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools</IDText><DisplayText><style face="italic superscript">14</style></DisplayText><record><dates><pub-dates><date>Oct</date></pub-dates><year>2012</year></dates><keywords><keyword>Arabidopsis</keyword><keyword>Internet</keyword><keyword>Plasmids</keyword><keyword>Registries</keyword><keyword>Seeds</keyword><keyword>Sequence Analysis, DNA</keyword><keyword>Software</keyword><keyword>Synthetic Biology</keyword><keyword>User-Computer Interface</keyword></keywords><urls><related-urls><url>, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools</title><secondary-title>Nucleic Acids Res</secondary-title></titles><pages>e141</pages><number>18</number><contributors><authors><author>Ham, T. S.</author><author>Dmytriv, Z.</author><author>Plahar, H.</author><author>Chen, J.</author><author>Hillson, N. J.</author><author>Keasling, J. D.</author></authors></contributors><edition>2012/06/19</edition><language>eng</language><added-date format="utc">1495616801</added-date><ref-type name="Journal Article">17</ref-type><rec-number>23</rec-number><last-updated-date format="utc">1495616801</last-updated-date><accession-num>22718978</accession-num><electronic-resource-num>10.1093/nar/gks531</electronic-resource-num><volume>40</volume></record></Cite></EndNote>14 for storing biological sequences and BioPartsDB ADDIN EN.CITE <EndNote><Cite><Author>Stracquadanio</Author><Year>2016</Year><IDText>BioPartsDB: a synthetic biology workflow web-application for education and research</IDText><DisplayText><style face="italic superscript">15</style></DisplayText><record><dates><pub-dates><date>Nov</date></pub-dates><year>2016</year></dates><urls><related-urls><url>: a synthetic biology workflow web-application for education and research</title><secondary-title>Bioinformatics</secondary-title></titles><pages>3519-3521</pages><number>22</number><contributors><authors><author>Stracquadanio, G.</author><author>Yang, K.</author><author>Boeke, J. D.</author><author>Bader, J. S.</author></authors></contributors><edition>2016/07/13</edition><language>eng</language><added-date format="utc">1495618018</added-date><ref-type name="Journal Article">17</ref-type><rec-number>24</rec-number><last-updated-date format="utc">1495618018</last-updated-date><accession-num>27412090</accession-num><electronic-resource-num>10.1093/bioinformatics/btw394</electronic-resource-num><volume>32</volume></record></Cite></EndNote>15 for workflow processes. These provide the individual components but lack the integration and depth required for project management. We therefore decided to develop an open source system that would fulfil the criteria demanded for synthetic biology.Understanding how Leaf LIMS operatesLeaf LIMS was designed first and foremost to handle information generated through laboratory operations. It takes a task-based approach, breaking down protocols and processes in the laboratory to their minimal components (e.g. steps such as an incubation or PCR conditions) and then arranging these to build workflow templates, much like a protocol. To run a workflow requires that a product, the final output of one or more workflows, needs to be defined: for instance the DNA parts required and their arrangement or the type of data required (e.g. screening data). These products are then grouped together into a project, a representation of related processes. A demonstration of this can be seen in Figure 2, where a project has been defined to optimise expression of protein X. There are eleven products, nine that represent constructs in E. coli, one set of screening data for all nine constructs, while the final product is the purified protein derived from the optimised construct. A run is started using the “Build DNA Construct” workflow template which provides the process designs for the constructs, the first nine products of this project. Each task is run until all nine constructs are produced based on the given designs. It is then possible to use these products, the nine constructs, in a different workflow template. In this example it is a protein screen to produce the output for the “screening data” product. Figure 2: An example of how Leaf LIMS operates. The project, in this case to optimise the expression of protein X, has eleven products. Nine products (labelled 1 to 9) are DNA constructs; once the nine constructs are created they will then be assayed to produce the data for the screening data product (labelled 10). The optimised protein product (labelled 11) will apply the results from the screening data to select the construct which produces the most protein for optimisation. For each product to be created, a run (Construct Assembly for Gene X) will take the selected products (in this case 1 to 9) and use a workflow template (Build DNA Construct) to complete each of the tasks by which it is defined. The initial information on what is required to make each product is specified using a design in one of the supported files types (e.g. csv or SBOL v2). The design files contain details of each part used (concentration, plate location and/or barcode, etc.), their order in the final construct, its unique ID and other parameters as chosen by the user, e.g. the DNA sequence. This results in the creation of nine constructs at the end of the workflow, which can now be used in the screening step.Managing projects and their outputsAny significant undertaking in a laboratory requires access to various sources of information, e.g. protocols and records of the progress of various constituent parts through the laboratory environment. Leaf LIMS organises logically related work into a project (e.g. Expression optimisation of protein X in Figure 2). It contains data on what is to be produced and for whom, along with any relevant information on how this is to be accomplished. A project can link to external resources (e.g. protocols and laboratory notebooks) and integrates with Salesforce CRM ADDIN EN.CITE <EndNote><Cite><Author></Author><Year>2017</Year><IDText>What is Salesforce?</IDText><DisplayText><style face="italic superscript">16</style></DisplayText><record><urls><related-urls><url> is Salesforce?</title></titles><contributors><authors><author></author></authors></contributors><added-date format="utc">1492071861</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>16</rec-number><last-updated-date format="utc">1492071944</last-updated-date></record></Cite></EndNote>16 for projects that require management of financial/customer information and communications. To provide flexibility to the system, Leaf LIMS uses products to indicate the outputs of the project. By organising the project in this way it is possible to track every component that makes up the product in a single location in the user interface, thus ensuring that items are accounted for through the entire process. These items are tracked through an integrated inventory system and include both the items required to produce the final product (e.g. DNA parts) as well as items generated during the creation process (e.g. constructs) (Fig. 3). Storing items in this manner allows for them to be shared between projects, a key requirement when using a parts-based toolkit ADDIN EN.CITE <EndNote><Cite><Author>Engler</Author><Year>2014</Year><IDText>A golden gate modular cloning toolbox for plants</IDText><DisplayText><style face="italic superscript">1</style></DisplayText><record><dates><pub-dates><date>Nov</date></pub-dates><year>2014</year></dates><keywords><keyword>Agrobacterium tumefaciens</keyword><keyword>Cloning, Molecular</keyword><keyword>Genetic Engineering</keyword><keyword>Genetic Vectors</keyword><keyword>Models, Genetic</keyword><keyword>Synthetic Biology</keyword><keyword>Tobacco</keyword></keywords><urls><related-urls><url> golden gate modular cloning toolbox for plants</title><secondary-title>ACS Synth Biol</secondary-title></titles><pages>839-43</pages><number>11</number><contributors><authors><author>Engler, C.</author><author>Youles, M.</author><author>Gruetzner, R.</author><author>Ehnert, T. M.</author><author>Werner, S.</author><author>Jones, J. D.</author><author>Patron, N. J.</author><author>Marillonnet, S.</author></authors></contributors><edition>2014/02/20</edition><language>eng</language><added-date format="utc">1490094910</added-date><ref-type name="Journal Article">17</ref-type><rec-number>9</rec-number><last-updated-date format="utc">1490094910</last-updated-date><accession-num>24933124</accession-num><electronic-resource-num>10.1021/sb4001504</electronic-resource-num><volume>3</volume></record></Cite></EndNote>1. A product does not have to be a physical item and can represent everything from a construct to protein expression data. It is possible to have an entirely data based workflow that does not produce a physical item at any point. This flexibility means that products can be tracked through the entire design - build - test cycle and ensures data is captured at every point. We also designed the facility to handle large numbers of products within each project, which is especially important with the increase in use of automation and the ease with which large numbers of differing constructs can be created.Leaf LIMS was designed to integrate with third party tools to allow flexibility in how constructs are designed as well as allowing it to fit into existing design workflows. To this end, support is in place for multiple file formats including Synthetic Biology Open Language (SBOL v2) ADDIN EN.CITE <EndNote><Cite><Author>Beal</Author><Year>2016</Year><IDText>Synthetic Biology Open Language (SBOL) Version 2.1.0</IDText><DisplayText><style face="italic superscript">17</style></DisplayText><record><dates><pub-dates><date>Dec</date></pub-dates><year>2016</year></dates><keywords><keyword>Programming Languages</keyword><keyword>Synthetic Biology</keyword></keywords><urls><related-urls><url> Biology Open Language (SBOL) Version 2.1.0</title><secondary-title>J Integr Bioinform</secondary-title></titles><pages>291</pages><number>3</number><contributors><authors><author>Beal, J.</author><author>Cox, R. S.</author><author>Grünberg, R.</author><author>McLaughlin, J.</author><author>Nguyen, T.</author><author>Bartley, B.</author><author>Bissell, M.</author><author>Choi, K.</author><author>Clancy, K.</author><author>Macklin, C.</author><author>Madsen, C.</author><author>Misirli, G.</author><author>Oberortner, E.</author><author>Pocock, M.</author><author>Roehner, N.</author><author>Samineni, M.</author><author>Zhang, M.</author><author>Zhang, Z.</author><author>Zundel, Z.</author><author>Gennari, J.</author><author>Myers, C.</author><author>Sauro, H.</author><author>Wipat, A.</author></authors></contributors><edition>2016/12/18</edition><language>eng</language><added-date format="utc">1496138763</added-date><ref-type name="Journal Article">17</ref-type><rec-number>25</rec-number><last-updated-date format="utc">1496138763</last-updated-date><accession-num>28187407</accession-num><electronic-resource-num>10.2390/biecoll-jib-2016-291</electronic-resource-num><volume>13</volume></record></Cite></EndNote>17, GenBank ADDIN EN.CITE <EndNote><Cite><Author>Benson</Author><Year>2013</Year><IDText>GenBank</IDText><DisplayText><style face="italic superscript">18</style></DisplayText><record><dates><pub-dates><date>Jan</date></pub-dates><year>2013</year></dates><keywords><keyword>Base Sequence</keyword><keyword>Databases, Nucleic Acid</keyword><keyword>Genomics</keyword><keyword>High-Throughput Nucleotide Sequencing</keyword><keyword>Internet</keyword><keyword>Molecular Sequence Annotation</keyword><keyword>Sequence Analysis, DNA</keyword></keywords><urls><related-urls><url> Acids Res</secondary-title></titles><pages>D36-42</pages><number>Database issue</number><contributors><authors><author>Benson, D. A.</author><author>Cavanaugh, M.</author><author>Clark, K.</author><author>Karsch-Mizrachi, I.</author><author>Lipman, D. J.</author><author>Ostell, J.</author><author>Sayers, E. W.</author></authors></contributors><edition>2012/11/27</edition><language>eng</language><added-date format="utc">1496139433</added-date><ref-type name="Journal Article">17</ref-type><rec-number>26</rec-number><last-updated-date format="utc">1496139433</last-updated-date><accession-num>23193287</accession-num><electronic-resource-num>10.1093/nar/gks1195</electronic-resource-num><volume>41</volume></record></Cite></EndNote>18 and CSV. The imported files are parsed to extract the relevant information required to build a product, for example a construct. These designs can be re-parsed at any point to extract further information. From these data, links are created to items in the inventory system, providing information required for use in tasks and for display in an SBOL visual formatted diagram. Linking back to the inventory this way allows for combinatorial designs to utilise the same parts in different contexts. Tasks can produce outputs (e.g. a transcriptional unit) which are then included in the inventory and linked to the respective product. A design file can also be uploaded that contains components not available at the time. Subsequently, when available the system will identify them and automatically include them. For example a design could reference multiple individual transcriptional units, each of which have been assembled as separate products with their own design. An entire project can therefore be planned in advance with all designs finalised before any laboratory work begins.Figure 3: Linking products to the inventory. The three products shown contain designs to make different constructs expressing gene X (labelled GX), each with a different RBS. Yellow lines indicate a part used in all constructs, purple lines indicate a part used only once. All the relevant DNA parts to make these constructs are entered into the integrated inventory system as individual items. On the left are the required inputs (in this case the DNA parts) that are then linked to the products that require them. These can then be used in a task, where the respective workflow picks what is needed to make the construct. Once a construct has been produced it is automatically added to the inventory as an item of the type “DNA Construct” and linked to the product that was used to create it. All the information on the created product is therefore stored in one place. Managing laboratory processes through workflowsThe intention of synthetic biology is to bring an engineering-focused approach to biology. To this end, Leaf LIMS uses task-based workflows to provide a modular system for building a consistent process (Fig 4). Each task provides a set of fields that are used to record information on inputs, values and outputs. These data are linked to the products produced by the task and cannot be modified afterwards, even if the task or workflow is changed. This mode of function allows the LIMS to operate with both automated and manual data entry, thus being adaptable to differences in how laboratories operate. These tasks and workflows can ultimately be shared between laboratories, thus allowing for easy adoption of new or improved processes without requiring specific equipment.Figure 4: Starting and completing a task during a run. Three constructs, the products of an earlier workflow have been linked to the workflow for the screening data product. The constructs are automatically picked up as the run has been set up to use these products. Three items are selected from the inventory: transformation plates, SOC media and BL21 cells. These items are required to complete the task and the inventory will be automatically updated to reflect the amounts used. At this point it is also possible to make alterations to the amounts to be utilised before starting (for instance if you wanted to halve the volume of cells used). Once data has been entered and items selected, one or more files can be exported to a CSV file and then imported by the equipment to enable set up. Once the task has been completed it will generate three outputs (reflected by having three items passed to the task) of the “transformation plate” type. The plates will at this point physically contain the transformation mix ready for use in the laboratory. These are added to the inventory and reflect physical items created during the transformation step. To make it easy to find these they are also then linked to the product that created them so they can be used in the next step. Any information entered into the task is also stored (e.g. input types and amounts) and results are then copied over and linked for later use.Tasks are intended to represent the smallest possible unit of a process; these can then be combined in different orders to create workflows that can run a given process. A task consists of a number of fields that define its structure; input fields record the components of a task (e.g. a buffer) while output fields produce an item in the inventory system corresponding to the output generated by the process (e.g. colonies on a plate). There are also field types designed to simplify the setup of a task for the user. Calculations can be defined to compute values that can be used in other fields (for example to derive the amount needed for an input field) and in any files to be generated by the task. These calculations can refer to other fields by their names and in conjunction with variable fields can be used to easily modify common parameters for a process (e.g. the annealing temperature in a PCR). Not all are fully automated and equipment often requires at least some level of human intervention. Step fields are provided for this purpose, allowing for written descriptions with the capacity to define up to four parameters (e.g. for defining times or temperatures in the step). All the fields in a task are assigned a default value that can be overridden either for all items or for individual items by supplying a file containing the necessary information. Using a combination of these fields, an entire protocol can be constructed in such a way that a user, without prior knowledge of the protocol, can accurately run the task with the pertinent values.A workflow acts as a template for the order in which tasks are to be executed. However, experiments may need to be modified and additional tasks may be required to keep pace with developments. Leaf LIMS uses the concept of a "run" to solve this issue, taking the order of tasks defined in a workflow yet allowing for new tasks to be dropped in at latter points to effect changes to an established workflow. The next task will not become available until the current task has been completed successfully. If there are failures at the completion of the current task a new run will be generated, initiating either at the current or an earlier task. Successes will result in the next task of the workflow being initiated until the workflow is complete. These decisions are currently made manually based on the results obtained but future work will incorporate automated decision making, allowing tasks to be initiated based on criteria such as responses from equipment and decision trees. Data produced from equipment (e.g. sequencing results for QC) available in a network accessible location can be copied across and linked to the relevant products.Tasks and workflows are fully configurable in Leaf LIMS with no limit to the number of tasks or workflows that can be created. They are designed to be re-used or easily modified to create variations around commonly used procedures. Existing tasks can be readily adapted to new situations without impacting on any of the data that has already been gathered as this is permanently associated with the product.Sample and consumable trackingOne of the issues when working in a busy, multi-person laboratory is that it can be difficult to know the availability of consumables at any one point. Leaf LIMS addresses this by providing an inventory system to track items including DNA parts and reagents. This system is flexible enough to record amounts of both labware (e.g. boxes of plates) and contents of tubes (e.g. enzymes, plasmids, primers). Each has an associated amount, for instance an SI unit such as microlitres. Using SI units allows for partial amounts to be used (e.g. 50?l out of 5ml) and remain accurate. Custom units can also be defined although these are limited to being defined as an integer. To handle other information that is not one of the core inventory fields, such as manufacturer, item properties can be defined. Each property on an item represents a user defined key with a corresponding written value and no limit to the number of properties. Any inventory item created by the system automatically provides a link to the items that created it, allowing for the iterative building of new items (e.g. parts) yet still maintaining the connection to its sources. The inventory is also coupled into an alert framework which can be used to set up conditions to monitor and provide notifications when they are reached. With this it is possible to set up thresholds at which the relevant users are notified so that the item is restocked in a timely way. The inventory system is designed so that it can work in parallel with existing dedicated inventory systems such as Mosaic ADDIN EN.CITE <EndNote><Cite><Author>Titian</Author><Year>2017</Year><IDText>Mosaic</IDText><DisplayText><style face="italic superscript">19</style></DisplayText><record><urls><related-urls><url> Software</author></authors></contributors><added-date format="utc">1492072158</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>17</rec-number><last-updated-date format="utc">1492072200</last-updated-date></record></Cite></EndNote>19 and Open Freezer ADDIN EN.CITE <EndNote><Cite><Author>Olhovsky</Author><Year>2011</Year><IDText>OpenFreezer: a reagent information management software system</IDText><DisplayText><style face="italic superscript">20</style></DisplayText><record><dates><pub-dates><date>Jul</date></pub-dates><year>2011</year></dates><keywords><keyword>Databases, Factual</keyword><keyword>Documentation</keyword><keyword>Indicators and Reagents</keyword><keyword>Information Storage and Retrieval</keyword><keyword>Internet</keyword><keyword>Software</keyword><keyword>User-Computer Interface</keyword></keywords><urls><related-urls><url>: a reagent information management software system</title><secondary-title>Nat Methods</secondary-title></titles><pages>612-3</pages><number>8</number><contributors><authors><author>Olhovsky, M.</author><author>Williton, K.</author><author>Dai, A. Y.</author><author>Pasculescu, A.</author><author>Lee, J. P.</author><author>Goudreault, M.</author><author>Wells, C. D.</author><author>Park, J. G.</author><author>Gingras, A. C.</author><author>Linding, R.</author><author>Pawson, T.</author><author>Colwill, K.</author></authors></contributors><edition>2011/07/28</edition><language>eng</language><added-date format="utc">1492072353</added-date><ref-type name="Journal Article">17</ref-type><rec-number>18</rec-number><last-updated-date format="utc">1492072353</last-updated-date><accession-num>21799493</accession-num><electronic-resource-num>10.1038/nmeth.1658</electronic-resource-num><volume>8</volume></record></Cite></EndNote>20 but as yet there is no integration beyond allowing import of items already in the previously mentioned systems.Controlling accessEach project, product, workflow and inventory item is assigned a set of permissions that indicate who and at what level groups of users can access it. A user can be assigned to multiple groups, each of which can grant either read or write access to a project. This allows a simple but granular access to items in the system thereby allowing individuals to work within the same system without revealing access controlled information.Using equipment to automate processesWhile automation is becoming more prevalent in the laboratory setting there are still limitations and almost all laboratories still require the handling of various steps by laboratory personnel. Leaf LIMS aims to strike a balance between providing a human-focused approach yet still offering the capability to simplify interactions with automated equipment. At its most simple, Leaf LIMS allows for tasks to generate files that can be used as inputs for equipment by using information defined in the task (e.g. volumes, component location). Files for use as inputs to equipment are easily configurable, allowing integration with any equipment that supports text files. Eventually this will be expanded and will allow for true communications between laboratory equipment and tasks, driving the system towards minimal manual intervention. We aim to take advantage of a plugin-based system for integration with equipment, for example producing optimised task lists and directly interfacing with equipment.Leaf LIMS also provides a machine-agnostic approach to handling the files generated by equipment. The files that are required to be copied over can be defined directly or through a template system that allows variables (such as the date) or regular expressions to be used to create the name of the file. They are then copied and linked to the relevant products when a task using that equipment is run. These can then be queried later to extract further information.Expanding the scope beyond synthetic biologyLeaf LIMS is designed primarily with synthetic biology in mind but this does not restrict it from being used for other purposes. While synthetic biology requires the handling of a variety of complex tasks other areas are as demanding in their requirements and would benefit from its flexibility and adaptability. The highly configurable nature of the system can be adapted for use in any circumstance where repeated, consistent workflows are required or where items are produced and need to be tracked; examples of this could include use in generalised laboratory settings where protocols are converted to tasks and workflows or as a tracking system for managing academic projects and their outputs and even for use in purely data driven settings such as bioinformatic analyses where projects and data need to be managed.Future directionThe future roadmap covers a wide range of features that will enhance the current system’s utility. In the immediate future these include the creation of plugins to allow integration of available tools and extensions to the system (e.g. integration with parts repositories such as ICE ADDIN EN.CITE <EndNote><Cite><Author>Ham</Author><Year>2012</Year><IDText>Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools</IDText><DisplayText><style face="italic superscript">14</style></DisplayText><record><dates><pub-dates><date>Oct</date></pub-dates><year>2012</year></dates><keywords><keyword>Arabidopsis</keyword><keyword>Internet</keyword><keyword>Plasmids</keyword><keyword>Registries</keyword><keyword>Seeds</keyword><keyword>Sequence Analysis, DNA</keyword><keyword>Software</keyword><keyword>Synthetic Biology</keyword><keyword>User-Computer Interface</keyword></keywords><urls><related-urls><url>, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools</title><secondary-title>Nucleic Acids Res</secondary-title></titles><pages>e141</pages><number>18</number><contributors><authors><author>Ham, T. S.</author><author>Dmytriv, Z.</author><author>Plahar, H.</author><author>Chen, J.</author><author>Hillson, N. J.</author><author>Keasling, J. D.</author></authors></contributors><edition>2012/06/19</edition><language>eng</language><added-date format="utc">1495616801</added-date><ref-type name="Journal Article">17</ref-type><rec-number>23</rec-number><last-updated-date format="utc">1495616801</last-updated-date><accession-num>22718978</accession-num><electronic-resource-num>10.1093/nar/gks531</electronic-resource-num><volume>40</volume></record></Cite></EndNote>14). Further work would include the integration of more third-party tools, such as the design of constructs, equipment control software to allow greater levels of automation and machine integration and a greater range of inventory management tools to boost the capabilities of the integrated system. There are also plans to increase the planning capabilities of the LIMS to allow for the estimation of costs and time involved in a given workflow.Leaf LIMS is currently being used in three laboratories (GeneMill, EGF and Earlham), with their feedback used to prioritise the roadmap for development. This level of co-development ensures that Leaf LIMS is constantly on the forefront of best practices for synthetic biology and receives features that benefit a broad range of users rather than just a single laboratory. The benefits of Leaf LIMS are already being seen in GeneMill where it has reduced the amount of time taken to plan and document laboratory work. For example, when running multiple projects simultaneously, it is desirable to run as many assemblies or PCR reactions in one plate as possible. However, it can be very time-consuming to plan this manually in order to maximise the productivity of laboratory staff and/or automation. The introduction of Leaf LIMS has greatly streamlined these processes, decreasing time spent per project, improving turnaround time and reducing the cost to the user.ConclusionLeaf LIMS provides a system that is tailor-made for handling the complexity of synthetic biology workflows. It can track the myriad components required for a specific product and handles the various required outputs from the processes that create it through an inventory system. It is centred on a task-based system that is easy to use and configure, allowing efficient development and utilisation of common workflows, storing the relevant data and finally ensures that project and product information is easily accessible. The code has been released as open source to ensure that the community has access and can collaborate ensuring the continued development of Leaf LIMS and the ability to adapt to new situations and technologies as they evolve.MethodsLeaf LIMS is composed of two components that communicate to provide a working system: a back-end API (Application Programming Interface) written in Python and a front-end written in HTML utilising the AngularJS framework. These components can be deployed separately or together under one package using Docker (Fig. 5). Figure 5: How Leaf LIMS is packaged for use. The system is usually deployed on Docker (grey box) but functions the same if each component is installed manually. The backend can be run separately if a user interface is not required or a custom user interface (UI) is required. All requests are passed to the Nginx server which either displays the UI or passes the request directly to the API on the backend. Requests to the API query the database for information or create a background task if the request requires significant run time. When distributed through Docker, two packages are created: front end (green box) and back end (blue boxes) but are usually installed through the docker-compose tool which installs these along with all the other requirements (the database and background task handling).The back end provides a REST (Representational state transfer) API written in Python 3 (specifically 3.5) and utilises the Django and Django REST Frameworks to create an easily extensible and highly capable code base. Data is stored in a Postgres 9 database and takes advantage of some of the newer features regarding the storing and querying of JSON data. The system design is modular in nature which makes it simple to add new features and thus make regular improvements to the versatility of Leaf LIMS. Various open source technologies are used through the system to prevent re-invention such as Redis for running background tasks. The API is designed to be able to communicate with third party solutions therefore this provides token based authentication, meaning these third parties are not required to store the username or password.The front end of Leaf LIMS was designed to be system agnostic allowing it to be run on any system that offers a modern, standards-compliant web browser. To this end it was written in HTML and utilises the AngularJS and Angular Material libraries to support complex interactions available in native applications on Windows or Mac OS. Taking advantage of HTML5 technology it can function on a PC or mobile with ease, adjusting itself to suit screen restrictions.Figure 6: The project details page at different screen resolutions. Leaf LIMS adapts the page to suit the screen resolution, repositioning elements so they can still be used. There is no degradation in functionality at mobile sized resolutions and all tasks can still be performed.Any LIMS is complex and requires constant development to ensure it maintains parity with advances in synthetic biology. To ensure that during this development process the system maintains its correct functionality, Leaf LIMS provides a comprehensive test suite. Using this, any changes can be audited, allowing any changes in operation to be identified and fixed before deployment.Leaf LIMS is open source and available under a permissive licence. The code, including documentation, is available on GitHub under the Leaf LIMS project (). All contributions towards the project are welcome, including bug fixes and new features. Leaf LIMS is designed to be deployed easily to any system that supports Docker. Docker provides a light-weight virtualised environment which reduces issues with conflicting dependencies and allows the various components to be deployed with minimal technical knowledge ADDIN EN.CITE <EndNote><Cite><Author>Dirk</Author><Year>2014</Year><IDText>Docker: lightweight Linux containers for consistent development and deployment</IDText><DisplayText><style face="italic superscript">21</style></DisplayText><record><isbn>1075-3583</isbn><custom1>2600241</custom1><titles><title>Docker: lightweight Linux containers for consistent development and deployment</title><secondary-title>Linux J.</secondary-title></titles><pages>2</pages><number>239</number><contributors><authors><author>Dirk Merkel</author></authors></contributors><added-date format="utc">1488461445</added-date><ref-type name="Journal Article">17</ref-type><dates><year>2014</year></dates><rec-number>2</rec-number><last-updated-date format="utc">1488461445</last-updated-date><volume>2014</volume></record></Cite></EndNote>21. The Docker images are available for use on Docker hub and can be integrated into other deployments. Using the docker-compose tool ADDIN EN.CITE <EndNote><Cite><Author>Docker Inc</Author><Year>2017</Year><IDText>Docker Compose</IDText><DisplayText><style face="italic superscript">22</style></DisplayText><record><urls><related-urls><url> Compose</title></titles><number>02/03/2017</number><contributors><authors><author>Docker Inc,</author></authors></contributors><added-date format="utc">1488467101</added-date><pub-location> name="Web Page">12</ref-type><dates><year>2017</year></dates><rec-number>3</rec-number><last-updated-date format="utc">1488469413</last-updated-date></record></Cite></EndNote>22 it is possible to indicate the required pre-requisites (e.g. Postgres 9.5) and have Docker automatically fetch the required versions and run them in the configuration required. Any updates (including adding in more prerequisites) simply require stopping the existing instance and re-running docker-compose to upgrade to the newest version. Configuration is kept minimal with only minor editing of files required. The back-end Docker image uses Daphne as a server to provide the API while the front-end uses the Nginx server to serve the HTML content and reverse proxy API requests to the back-end.Leaf LIMS may be used to communicate sensitive data (e.g. client confidential) and therefore mandates to the use of secure socket technology. When deploying, an SSL certificate is required and all connections are automatically routed to the secure port on the webserver.Author InformationPlease address all correspondence to Thomas Craig or Mark Caddick at GeneMill, Merseybio Building, University of Liverpool, Crown Street, L69 7ZB.Author ContributionThomas Craig wrote most of the system presented as well as writing and collating revisions to the manuscript. Richard Holland wrote the test code, alert code and contributed heavily to code clean up. James Johnson and Rosalinda D’Amore came up with the initial brief for the system as well as contributing to editing the manuscript. Hannah V. McCue, Anthony West, Valentin Zulkower and Hille Tekotte provided testing, contributions of ideas and manuscript editing. Daniel Swan and Robert P Davey managed the project at Earlham with Robert managing Richard Holland during his time on the project. Robert P Davey, Yizhi Cai, Christiane Hertz-Fowler, Anthony Hall and Mark Caddick were PIs on the project and provided support, financing and helped to write and edit the manuscript. AcknowledgementsWe would like to thank the BBSRC for funding this work (BB/M00094X/1; BB/M025659/1; BB/M025640/1). We would also like to thank our colleagues within GeneMill, the Earlham institute and the Edinburgh Genome Foundry for providing valuable insight and support and Richard Holland for developing the test suite, alert framework and code clean up. Bibliography ADDIN EN.REFLIST [1] Engler, C., Youles, M., Gruetzner, R., Ehnert, T. M., Werner, S., Jones, J. D., Patron, N. J., and Marillonnet, S. (2014) A golden gate modular cloning toolbox for plants, ACS Synth Biol 3, 839-843.[2] Lee, M. E., DeLoache, W. C., Cervantes, B., and Dueber, J. E. (2015) A Highly Characterized Yeast Toolkit for Modular, Multipart Assembly, ACS Synth Biol 4, 975-986.[3] Casini, A., Storch, M., Baldwin, G. S., and Ellis, T. (2015) Bricks and blueprints: methods and standards for DNA assembly, Nat Rev Mol Cell Biol 16, 568-576.[4] Smanski, M. J., Bhatia, S., Zhao, D., Park, Y., B A Woodruff, L., Giannoukos, G., Ciulla, D., Busby, M., Calderon, J., Nicol, R., Gordon, D. B., Densmore, D., and Voigt, C. A. (2014) Functional optimization of gene clusters by combinatorial design and assembly, Nat Biotechnol 32, 1241-1249.[5] Kanigowska, P., Shen, Y., Zheng, Y., Rosser, S., and Cai, Y. (2016) Smart DNA Fabrication Using Sound Waves: Applying Acoustic Dispensing Technologies to Synthetic Biology, J Lab Autom 21, 49-56.[6] Earlham Institute. (2017) MISO (Managing Information for Sequencing Operations), .[7] Savitsky, M., Diprose, J. M., Morris, C., Griffiths, S. L., Daniel, E., Lin, B., Daenke, S., Bishop, B., Siebold, C., Wilson, K. S., Blake, R., Stuart, D. I., and Esnouf, R. M. (2011) Recording information on protein complexes in an information management system, J Struct Biol 175, 224-229.[8] Johnson, J. R., D'Amore, R., Thain, S. C., Craig, T., McCue, H. V., Hertz-Fowler, C., Hall, N., and Hall, A. J. (2016) GeneMill: A 21st century platform for innovation, Biochem Soc Trans 44, 681-683.[9] GenoLogics. (2017) BaseSpace Clarity LIMS, .[10] Barillari, C., Ottoz, D. S., Fuentes-Serna, J. M., Ramakrishnan, C., Rinn, B., and Rudolf, F. (2016) openBIS ELN-LIMS: an open-source database for academic laboratories, Bioinformatics 32, 638-640.[11] BikaLabs. (2017) BikaLIMS, .[12] Atlassian. (2017) JIRA Software - Issue and Project Tracking for Software Teams, .[13] Synthace. (2017) About Antha, Synthace, .[14] Ham, T. S., Dmytriv, Z., Plahar, H., Chen, J., Hillson, N. J., and Keasling, J. D. (2012) Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools, Nucleic Acids Res 40, e141.[15] Stracquadanio, G., Yang, K., Boeke, J. D., and Bader, J. S. (2016) BioPartsDB: a synthetic biology workflow web-application for education and research, Bioinformatics 32, 3519-3521.[16] . (2017) What is Salesforce?, .[17] Beal, J., Cox, R. S., Grünberg, R., McLaughlin, J., Nguyen, T., Bartley, B., Bissell, M., Choi, K., Clancy, K., Macklin, C., Madsen, C., Misirli, G., Oberortner, E., Pocock, M., Roehner, N., Samineni, M., Zhang, M., Zhang, Z., Zundel, Z., Gennari, J., Myers, C., Sauro, H., and Wipat, A. (2016) Synthetic Biology Open Language (SBOL) Version 2.1.0, J Integr Bioinform 13, 291.[18] Benson, D. A., Cavanaugh, M., Clark, K., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., and Sayers, E. W. (2013) GenBank, Nucleic Acids Res 41, D36-42.[19] Software, T. (2017) Mosaic, .[20] Olhovsky, M., Williton, K., Dai, A. Y., Pasculescu, A., Lee, J. P., Goudreault, M., Wells, C. D., Park, J. G., Gingras, A. C., Linding, R., Pawson, T., and Colwill, K. (2011) OpenFreezer: a reagent information management software system, Nat Methods 8, 612-613.[21] Merkel, D. (2014) Docker: lightweight Linux containers for consistent development and deployment, Linux J. 2014, 2.[22] Docker Inc. (2017) Docker Compose, . ................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download