Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition
Primena tehnika za sistematizovanu obradu tekstualnih informacija u cilju analize kvalitativnog sastava registrovanih preparata paracetamola i ibuprofena
Article (Published version)
Metadata
Show full item recordAbstract
Text mining (TM) applications in the field of biomedicine are gaining great interest. TM
tools can facilitate formulation development by analyzing textual information from patent
databases, scientific articles, summary of products characteristics, etc. The aim of this study was
to utilize TM tools to perform qualitative analysis of paracetamol (PAR) and ibuprofen (IBU)
formulations, in terms of identifying and evaluating the presence of excipients specific to the
active pharmaceutical ingredient (API) and/or dosage form. A total of 152 products were
analyzed. Web-scraping was used to retrieve the data, and Python-based open-source software
Orange 3.31.1 was used for TM and statistical analysis (ANOVA) of the obtained results. The
majority of marketed products for both APIs were tablets. The predominant excipients in all tablet
formulations were povidone, starch, microcrystalline cellulose and hypromellose. Povidone,
stearic acid, potassium sorbate, maize starch and pregelatin...ized starch occurred more frequently
in PAR tablets. On the other hand, titanium dioxide, lactose, shellac, sucrose and ammonium
hydroxide were specific to IBU tablets. PAR oral suspensions more frequently contained
dispersible cellulose; liquid sorbitol; methyl and propyl parahydroxybenzoate, glycerol and
acesulfame potassium. Specific excipients in other PAR dosage forms, such as effervescent
tablets, hard capsules, oral powders, solutions and suspensions, as well as IBU gels and soft
capsules, were also evaluated.
Primena text mining (TM) alata u oblasti biomedicine postaje sve značajnija. TM alati mogu da olakšaju razvoj formulacija, tako što omogućavaju analizu tekstualnih informacija iz patentnih baza, naučnih članaka, sažetaka karakteristika lekova, itd. Cilj ovog rada bila je primena TM alata za kvalitativnu analizu formulacija paracetamola (PAR) i ibuprofena (IBU), u smislu identifikacije i procene prisustva ekscipijenasa koji su karakteristični za lekovitu supstancu i/ili farmaceutski oblik. Ukupno je analiziran sastav 152 preparata. Web-scraping je primenjen za prikupljanje podataka, a Orange 3.31.1, softver otvorenog koda zasnovan na programskom jeziku Python, primenjen je za TM i statističku analizu (ANOVA) dobijenih rezultata. Većina analiziranih formulacija za obe lekovite supstance bile su tablete, a najzastupljeniji ekscipijensi u njima su bili povidon, skrob, mikrokristalna celuloza i hipromeloza. Povidon, stearinska kiselina, kalijum sorbat, kukuruzni skrob i pregelirani skrob se... češće pronalaze u formulacijama PAR tableta. Titanijum-dioksid, laktoza, šelak, saharoza i amonijum hidroksid su specifični za IBU tablete. PAR peroralne suspenzije su češće sadržale disperzibilnu celulozu; tečni sorbitol; metil-i propil parahidroksibenzoat, glicerol i acesulfam-kalijum. Takođe su identifikovani i specifični ekscipijensi za PAR efervescentne tablete, tvrde kapsule, peroralne praškove, rastvore i suspenzije, kao i za IBU gelove i meke kapsule.
Keywords:
text mining / dosage forms / qualitative analysis / excipients / paracetamol / ibuprofen / farmaceutski oblici / kvalitativna analiza / ekscipijensiSource:
Arhiv za farmaciju, 2022, 72, 6, 689-700Publisher:
- Pharmaceutical Association of Serbia
Funding / projects:
Collections
Institution/Community
PharmacyTY - JOUR AU - Đuriš, Jelena AU - Pilović, Jovana AU - Džunić, Marina AU - Cvijić, Sandra AU - Ibrić, Svetlana PY - 2022 UR - https://farfar.pharmacy.bg.ac.rs/handle/123456789/4412 AB - Text mining (TM) applications in the field of biomedicine are gaining great interest. TM tools can facilitate formulation development by analyzing textual information from patent databases, scientific articles, summary of products characteristics, etc. The aim of this study was to utilize TM tools to perform qualitative analysis of paracetamol (PAR) and ibuprofen (IBU) formulations, in terms of identifying and evaluating the presence of excipients specific to the active pharmaceutical ingredient (API) and/or dosage form. A total of 152 products were analyzed. Web-scraping was used to retrieve the data, and Python-based open-source software Orange 3.31.1 was used for TM and statistical analysis (ANOVA) of the obtained results. The majority of marketed products for both APIs were tablets. The predominant excipients in all tablet formulations were povidone, starch, microcrystalline cellulose and hypromellose. Povidone, stearic acid, potassium sorbate, maize starch and pregelatinized starch occurred more frequently in PAR tablets. On the other hand, titanium dioxide, lactose, shellac, sucrose and ammonium hydroxide were specific to IBU tablets. PAR oral suspensions more frequently contained dispersible cellulose; liquid sorbitol; methyl and propyl parahydroxybenzoate, glycerol and acesulfame potassium. Specific excipients in other PAR dosage forms, such as effervescent tablets, hard capsules, oral powders, solutions and suspensions, as well as IBU gels and soft capsules, were also evaluated. AB - Primena text mining (TM) alata u oblasti biomedicine postaje sve značajnija. TM alati mogu da olakšaju razvoj formulacija, tako što omogućavaju analizu tekstualnih informacija iz patentnih baza, naučnih članaka, sažetaka karakteristika lekova, itd. Cilj ovog rada bila je primena TM alata za kvalitativnu analizu formulacija paracetamola (PAR) i ibuprofena (IBU), u smislu identifikacije i procene prisustva ekscipijenasa koji su karakteristični za lekovitu supstancu i/ili farmaceutski oblik. Ukupno je analiziran sastav 152 preparata. Web-scraping je primenjen za prikupljanje podataka, a Orange 3.31.1, softver otvorenog koda zasnovan na programskom jeziku Python, primenjen je za TM i statističku analizu (ANOVA) dobijenih rezultata. Većina analiziranih formulacija za obe lekovite supstance bile su tablete, a najzastupljeniji ekscipijensi u njima su bili povidon, skrob, mikrokristalna celuloza i hipromeloza. Povidon, stearinska kiselina, kalijum sorbat, kukuruzni skrob i pregelirani skrob se češće pronalaze u formulacijama PAR tableta. Titanijum-dioksid, laktoza, šelak, saharoza i amonijum hidroksid su specifični za IBU tablete. PAR peroralne suspenzije su češće sadržale disperzibilnu celulozu; tečni sorbitol; metil-i propil parahidroksibenzoat, glicerol i acesulfam-kalijum. Takođe su identifikovani i specifični ekscipijensi za PAR efervescentne tablete, tvrde kapsule, peroralne praškove, rastvore i suspenzije, kao i za IBU gelove i meke kapsule. PB - Pharmaceutical Association of Serbia T2 - Arhiv za farmaciju T1 - Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition T1 - Primena tehnika za sistematizovanu obradu tekstualnih informacija u cilju analize kvalitativnog sastava registrovanih preparata paracetamola i ibuprofena VL - 72 IS - 6 SP - 689 EP - 700 DO - 10.5937/arhfarm72-40397 ER -
@article{ author = "Đuriš, Jelena and Pilović, Jovana and Džunić, Marina and Cvijić, Sandra and Ibrić, Svetlana", year = "2022", abstract = "Text mining (TM) applications in the field of biomedicine are gaining great interest. TM tools can facilitate formulation development by analyzing textual information from patent databases, scientific articles, summary of products characteristics, etc. The aim of this study was to utilize TM tools to perform qualitative analysis of paracetamol (PAR) and ibuprofen (IBU) formulations, in terms of identifying and evaluating the presence of excipients specific to the active pharmaceutical ingredient (API) and/or dosage form. A total of 152 products were analyzed. Web-scraping was used to retrieve the data, and Python-based open-source software Orange 3.31.1 was used for TM and statistical analysis (ANOVA) of the obtained results. The majority of marketed products for both APIs were tablets. The predominant excipients in all tablet formulations were povidone, starch, microcrystalline cellulose and hypromellose. Povidone, stearic acid, potassium sorbate, maize starch and pregelatinized starch occurred more frequently in PAR tablets. On the other hand, titanium dioxide, lactose, shellac, sucrose and ammonium hydroxide were specific to IBU tablets. PAR oral suspensions more frequently contained dispersible cellulose; liquid sorbitol; methyl and propyl parahydroxybenzoate, glycerol and acesulfame potassium. Specific excipients in other PAR dosage forms, such as effervescent tablets, hard capsules, oral powders, solutions and suspensions, as well as IBU gels and soft capsules, were also evaluated., Primena text mining (TM) alata u oblasti biomedicine postaje sve značajnija. TM alati mogu da olakšaju razvoj formulacija, tako što omogućavaju analizu tekstualnih informacija iz patentnih baza, naučnih članaka, sažetaka karakteristika lekova, itd. Cilj ovog rada bila je primena TM alata za kvalitativnu analizu formulacija paracetamola (PAR) i ibuprofena (IBU), u smislu identifikacije i procene prisustva ekscipijenasa koji su karakteristični za lekovitu supstancu i/ili farmaceutski oblik. Ukupno je analiziran sastav 152 preparata. Web-scraping je primenjen za prikupljanje podataka, a Orange 3.31.1, softver otvorenog koda zasnovan na programskom jeziku Python, primenjen je za TM i statističku analizu (ANOVA) dobijenih rezultata. Većina analiziranih formulacija za obe lekovite supstance bile su tablete, a najzastupljeniji ekscipijensi u njima su bili povidon, skrob, mikrokristalna celuloza i hipromeloza. Povidon, stearinska kiselina, kalijum sorbat, kukuruzni skrob i pregelirani skrob se češće pronalaze u formulacijama PAR tableta. Titanijum-dioksid, laktoza, šelak, saharoza i amonijum hidroksid su specifični za IBU tablete. PAR peroralne suspenzije su češće sadržale disperzibilnu celulozu; tečni sorbitol; metil-i propil parahidroksibenzoat, glicerol i acesulfam-kalijum. Takođe su identifikovani i specifični ekscipijensi za PAR efervescentne tablete, tvrde kapsule, peroralne praškove, rastvore i suspenzije, kao i za IBU gelove i meke kapsule.", publisher = "Pharmaceutical Association of Serbia", journal = "Arhiv za farmaciju", title = "Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition, Primena tehnika za sistematizovanu obradu tekstualnih informacija u cilju analize kvalitativnog sastava registrovanih preparata paracetamola i ibuprofena", volume = "72", number = "6", pages = "689-700", doi = "10.5937/arhfarm72-40397" }
Đuriš, J., Pilović, J., Džunić, M., Cvijić, S.,& Ibrić, S.. (2022). Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition. in Arhiv za farmaciju Pharmaceutical Association of Serbia., 72(6), 689-700. https://doi.org/10.5937/arhfarm72-40397
Đuriš J, Pilović J, Džunić M, Cvijić S, Ibrić S. Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition. in Arhiv za farmaciju. 2022;72(6):689-700. doi:10.5937/arhfarm72-40397 .
Đuriš, Jelena, Pilović, Jovana, Džunić, Marina, Cvijić, Sandra, Ibrić, Svetlana, "Application of text-mining techniques for extraction and analysis of paracetamol and ibuprofen marketed products’ qualitative composition" in Arhiv za farmaciju, 72, no. 6 (2022):689-700, https://doi.org/10.5937/arhfarm72-40397 . .