lightr: import spectral data in R

Lifecycle: stable CRAN version R build status Coverage status Reviewed by rOpenSci JOSS paper

There is no standard file format for spectrometry data and different scientific instrumentation companies use wildly different formats to store spectral data. Vendor proprietary software sometimes has an option but convert those formats instead human readable files such as csv but in the process, most metadata are lost. However, those metadata are critical to ensure reproducibility (White et al, 2015).

This package aims at offering a unified user-friendly interface for users to read UV-VIS reflectance/transmittance/absorbance spectra files from various formats in a single line of code.

Additionally, it provides for the first time a fully free and open source solution to read proprietary spectra file formats on all systems.

🗟 Citation

To cite this package in publications, please use:

Gruson H., White T.E., Maia R., (2019). lightr: import spectral data and metadata in R. Journal of Open Source Software, 4(43), 1857, https://doi.org/10.21105/joss.01857

🔧 Installation

install.packages("lightr")

You can also install the development version from rOpenSci’s CRAN-like repository:

install.packages("lightr", repos = "https://dev.ropensci.org")

💻 Usage

A thorough documentation is available with the package, using R usual syntax ?function or help(function). However, users will probably mainly use two functions:

# Get a data.frame containing all useful metadata from spectra in a folder
lr_get_metadata(where = system.file("testdata/procspec_files", 
                                    package = "lightr"), 
                ext = "ProcSpec")

and

# Get a single dataframe where the first column contains the wavelengths and 
# the next columns contain a spectra each (pavo's rspec class)
lr_get_spec(where = system.file("testdata/procspec_files", package = "lightr"),
            ext = "ProcSpec")

lr_get_spec() returns a dataframe that is compatible with pavo custom S3 class (rspec) and can be used for further analyses using colour vision models.

All supported file formats can also be parsed using the lr_parse_$extension() function where $extension is the lowercase extension of your file. This family of functions return a list where the first element is the data dataframe and the second element is a vector with relevant metadata.

Only exceptions are .txt and .Transmission files because those extensions are too generic. Users will need to figure out which parser is appropriate in this case. lr_get_metadata() and lr_get_spec() automatically try generic parsers in this case.

Alternatively, you may simply want to convert your spectra in a readable standard format and carry on with your analysis with another software.

In this case, you can run:

# Convert every single ProcSpec file to a csv file with the same name and 
# location
lr_convert_tocsv(where = system.file("testdata/procspec_files", 
                                      package = "lightr"),
                 ext = "ProcSpec")

✔ Supported file formats

This package is still under development but currently supports (you can click on the extension in the tables to see an example of this file format):

OceanOptics/OceanInsight

Extension Parser
jdx lr_parse_jdx()
ProcSpec lr_parse_procspec()
spc lr_parse_spc()
jaz lr_parse_jaz()
JazIrrad lr_parse_jazirrad()
Transmission lr_parse_jaz()
txt lr_parse_jaz()

Avantes

Extension Parser
ABS lr_parse_abs()
ROH lr_parse_roh()
TRM lr_parse_trm()
trt lr_parse_trt()
ttt lr_parse_ttt()
txt lr_parse_generic()
DRK lr_parse_trm()
REF lr_parse_trm()
IRR8 lr_parse_irr8()
RFL8 lr_parse_rfl8()
Raw8 lr_parse_raw8()

CRAIC

Extension Parser
txt lr_parse_generic()
spc lr_parse_spc()

Others

Extension Parser
csv lr_parse_generic(sep = ",")
dpt lr_parse_generic(sep = ",")

As a fallback, you should always try lr_parse_generic() which offers a flexible and general algorithm that manages to extract data from most files.

If you can’t find the best parser for your specific file or if you believe you are using an unsupported format, please open an issue or send me an email.

🌐 Similar projects

To our knowledge, lightr is the only gratis tool to import some complex file formats such as Avantes (ABS, ROH, TRM, RFL8) or CRAIC (spc) binary files, or OceanOptics .ProcSpec. Because of its user-friendly high-levels functions and low dependency philosophy, lightr may also hopefully prove useful for people working with other languages than R.

Contributing

There are plenty of ways you can contribute to lightr. Please visit our contributing guide.

Please note that this package is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.