There is disclosed a filing apparatus capable of improving the operability and the work efficiency in the fitting, in prepared data, of multi-media data prepared by another application. The word processor sends a request containing a predetermined code and a keyword to the filing apparatus, which judges whether the received code is a predetermined code. If it is the predetermined code, a file containing the keyword is retrieved. Then judged is the number of the retrieved multi-media data, and, if plural multimedia data are retrieved, there is displayed a retrieval image frame for selecting one of the retrieved multi-media data, but, if only one multi-media data is retrieved, it is directly fitted in a word processed text.