NCA-GENM試験無料問題集「NVIDIA Generative AI Multimodal 認定」
You are developing a multimodal model that combines time-series data from sensor readings with natural language descriptions of events. The time-series data has varying sampling rates and the text descriptions are often vague and ambiguous. How would you best address the challenge of aligning and fusing these two modalities to improve model performance?
正解:A
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You're using Stable Diffusion with a custom prompt to generate images of landscapes. You notice that the generated images consistently lack detail and appear blurry, despite increasing the number of inference steps. Which of the following prompt engineering techniques, combined with appropriate parameter tuning, is MOST likely to address this issue and improve the image's sharpness and detail?
正解:D
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You have a multimodal model that takes video and audio as input for activity recognition. You want to evaluate the impact of different fusion strategies (early fusion, late fusion, intermediate fusion) on the model's accuracy and computational cost. Which of the following statements is generally TRUE regarding these fusion strategies?
正解:A
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are working on a generative A1 model that creates descriptions of images. During experimentation, you notice the model consistently generates descriptions that are factually incorrect about objects in the image, despite the image quality being high. For example, it might describe a 'cat' as a 'dog'. What is the MOST critical step to address this issue?
正解:C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
A research team is developing a multimodal model to predict stock prices using financial news articles, company filings (text), historical stock prices (time-series), and executive interviews (audio). They are experiencing significant performance issues due to inconsistent data quality across modalities. What specific strategies would you recommend to address these data quality challenges?
正解:A
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are tasked with visualizing the performance of a Generative A1 model across different categories of input dat a. You need to show both the accuracy and the number of data points in each category. Which visualization technique would be MOST effective for this purpose?
正解:C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are fine-tuning a large pre-trained language model for a specific downstream task using a limited amount of training dat a. Which of the following techniques is MOST likely to prevent overfitting and improve the model's generalization performance?
正解:C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are developing a system that generates 3D models from text descriptions. The system currently produces models that are geometrically accurate but lack fine-grained surface details and realistic textures. Which of the following steps would be MOST effective in improving the visual realism of the generated 3D models?
正解:B
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are tasked with optimizing a multimodal A1 model that processes both images and text. You observe significant latency during the image encoding phase using a pre-trained ResNet50 model. Which of the following techniques would be MOST effective in reducing latency while preserving accuracy, considering energy efficiency?
正解:E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are working on a project that involves analyzing customer reviews which contains the following dataset: 1. customer_id(categorical) 2. customer_review(text) 3. product_image(image) 4. video_of_product_usage(video) What is the best way to handle and address the problem of skewness across each modailities?
正解:A,D,E
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
Consider a multimodal generative A1 model that produces images based on textual prompts. The model is prone to generating images that are similar to those in the training data, resulting in a lack of novelty. Which hyperparameter adjustment would be MOST effective in increasing the diversity of the generated images?
正解:A
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are developing a system to automatically generate image descriptions for visually impaired users. The system uses a combination of object detection, attribute recognition, and relationship extraction. However, the generated descriptions often lack detail and fail to capture the nuances of the image content. Which of the following strategies would MOST effectively address this limitation?
正解:B
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You're developing a multimodal A1 system that takes image data, text descriptions, and user interaction data (clicks, dwell time) to generate personalized product recommendations. To effectively combine these modalities and capture complex relationships, which model architecture would be most suitable?
正解:C
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)
You are fine-tuning a pre-trained language model for a specific task. You notice that the model performs well on the training data but poorly on the validation dat a. Which of the following techniques can help mitigate this overfitting problem? (Select TWO)
正解:A,B
解答を投票する
解説: (GoShiken メンバーにのみ表示されます)