Command-line interface¶
After installing chemFilters, the chemfilters command is available for filtering
datasets from the terminal.
Usage¶
chemfilters -i INPUT -o OUTPUT [OPTIONS]
Required arguments¶
Flag |
Description |
|---|---|
|
Path to the input file containing SMILES strings |
|
Path to the output CSV file |
Optional arguments¶
Flag |
Default |
Description |
|---|---|---|
|
|
Column name for CSV input. If not provided, input is treated as a text file with one SMILES per line |
|
enabled |
Apply RDKit structural alert filters |
|
Disable RDKit filters |
|
|
|
Subset of RDKit filters to apply (e.g., |
|
|
Output format: |
|
enabled |
Apply peptide filters (PepSift) |
|
Disable peptide filters |
|
|
enabled |
Apply silly molecule filters (molspotter) |
|
Disable silly molecule filters |
|
|
enabled |
Apply purchasability filters (molbloom) |
|
Disable purchasability filters |
|
|
enabled |
Standardize molecules before filtering |
|
Skip standardization |
|
|
|
Standardization method ( |
|
|
Number of parallel jobs |
|
|
Process SMILES in chunks of this size. Negative means process all at once. Reduce if memory is limited |
Examples¶
Filter a text file of SMILES using all filters:
chemfilters -i molecules.txt -o results.csv
Filter a CSV file, disabling peptide and silly filters:
chemfilters -i data.csv -c smiles_column -o results.csv \
--no-pep-filter --no-silly-filter
Use only PAINS filters with boolean output:
chemfilters -i molecules.txt -o results.csv \
--rdkit-subset PAINS --rdkit-valtype bool \
--no-pep-filter --no-silly-filter --no-bloom-filter