Please use this identifier to cite or link to this item: https://elibrary.khec.edu.np:8080/handle/123456789/189
Full metadata record
DC FieldValueLanguage
dc.contributor.authorShrestha, Shiva Kumar-
dc.contributor.authorJoshi, Shashidhar Ram-
dc.date.accessioned2022-05-02T11:50:08Z-
dc.date.available2022-05-02T11:50:08Z-
dc.date.issued2020-09-
dc.identifier.issn2091—1475 (Print)-
dc.identifier.issn2645-8518 (Online)-
dc.identifier.urihttps://doi.org/10.3126/jsce.v8i0.32860-
dc.description.abstractThe process of generating an image that depicts naturalness is not so easy. To address such problem this paper introduces a novel approach to synthesize a photo-realistic image from the caption. The user can adjust the image highlights turn-by-turn according to the caption. This leads to the integration of natural intelligence. For this, the input passed to dialogue state tracker to extract context feature. Then the generator produces an image. If image is not as per expectations then user gives another dialogue, but the system takes both recent input and previous image to generate a new one. In such a manner, user gets a chance to visualize as per the imagination. We performed extensive experiments on two datasets CUB and COCO to generate a realistic image each turn and obtained the results: Inception Score (IS) of 4.38 ± 0.05, R-precision of 67.96 ± 5.27 % on CUB dataset and IS of 26.12 ± 0.24, R-precision of 91.00 ± 2.31 % on COCO dataset. Further, the work could be enhance to synthesize HQ image, voice integration, and video generation from stories and so on. This research is limited to 256x256 image in each turn.en_US
dc.language.isoenen_US
dc.subjectGAN, MultiTurnGAN, Text-to-image, Image generation, Realistic image synthesisen_US
dc.titlePHOTOGRAPHIC TEXT-TO-IMAGE SYNTHESIS VIA MULTI-TURN DIALOGUE USING ATTENTIONAL GANen_US
Appears in Collections:Journal of Science and Engineering Vol.8

Files in This Item:
File Description SizeFormat 
4_Shiva_Kumar_Shrestha.pdf1.32 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.