SFILES 2.0: An extended text-based flowsheet representation

by   Gabriel Vogel, et al.

SFILES is a text-based notation for chemical process flowsheets. It was originally proposed by d'Anterroches (2006) who was inspired by the text-based SMILES notation for molecules. The text-based format has several advantages compared to flowsheet images regarding the storage format, computational accessibility, and eventually for data analysis and processing. However, the original SFILES version cannot describe essential flowsheet configurations unambiguously, such as the distinction between top and bottom products. Neither is it capable of describing the control structure required for the safe and reliable operation of chemical processes. Also, there is no publicly available software for decoding or encoding chemical process topologies to SFILES. We propose the SFILES 2.0 with a complete description of the extended notation and naming conventions. Additionally, we provide open-source software for the automated conversion between flowsheet graphs and SFILES 2.0 strings. This way, we hope to encourage researchers and engineers to publish their flowsheet topologies as SFILES 2.0 strings. The ultimate goal is to set the standards for creating a FAIR database of chemical process flowsheets, which would be of great value for future data analysis and processing.


page 1

page 2

page 3

page 4


Learning from flowsheets: A generative transformer model for autocompletion of flowsheets

We propose a novel method enabling autocompletion of chemical flowsheets...

The Value and Use of Data in Chemical Engineering Practice

The ability to generate, organize, analyze, understand and leverage data...

Unicode at Gigabytes per Second

We often represent text using Unicode formats (UTF-8 and UTF-16). The UT...

Group SELFIES: A Robust Fragment-Based Molecular String Representation

We introduce Group SELFIES, a molecular string representation that lever...

Chemical Names Standardization using Neural Sequence to Sequence Model

Chemical information extraction is to convert chemical knowledge in text...

A Quantum Finite Automata Approach to Modeling the Chemical Reactions

In recent years, the modeling interest has increased significantly from ...

Please sign up or login with your details

Forgot password? Click here to reset