使用JavaMail阅读阿拉伯文附件

宗政兴发

2023-03-14

问题内容：

我使用Java Mail下载阿拉伯文附件文件时遇到问题。

文件名始终是不明确的。

问题是Bodypart检索附件为非UTF-8字符。

private void getAttachments(Message temp) throws IOException, MessagingException {
    List<File> attachments = new ArrayList<File>();

    Multipart multipart = (Multipart) temp.getContent();

    System.out.println(multipart.getCount());

    for (int i = 0; i < multipart.getCount(); i++) {
        BodyPart bodyPart = multipart.getBodyPart(i);
        if (!Part.ATTACHMENT.equalsIgnoreCase(bodyPart.getDisposition())) {
            continue; // dealing with attachments only
        }
        InputStream is = bodyPart.getInputStream();

        // getFilename always have wrong characters set 
        byte [] fileBytes = bodyPart.getFileName().toString().getBytes();

        String filename = new String(fileBytes, "UTF-8");

        File f = new File("C:\\Attachments\\" + filename);

         System.out.println(f .getName());

         try {
        if (f == null) {
            //filename = File.createTempFile("VSX", ".out").getName();
            return;
        }

        FileOutputStream fos = new FileOutputStream(f );
        BufferedOutputStream bos = new BufferedOutputStream(fos);
        BufferedInputStream bis = new BufferedInputStream(is);

        int aByte;
        while ((aByte = bis.read()) >=0) {
            bos.write(aByte);
        }

        fos.flush();
        bos.flush();
        bos.close();
        bis.close();
        fos.close();
    } // end of try()
    catch (IOException exp) {
        System.out.println("IOException:" + exp);
    }

        attachments.add(f);
    }
}

问题答案：

标头是根据RFC 2047中描述的机制（即encoded-word）编码的，该机制表示与=? < encoding> ?B? <
encoded-bytes>_匹配的标头的部分?=是字节编码的部分。所述
< 编码>说如何解释的字节数，和（因为它是B风格，而不是Q样式）的

< 编码字节>_是基64编码。

这都是相当复杂的。幸运的是，您可以使用静态javax.mail.internet.MimeUtility.decodeText()方法轻松处理此问题。这意味着您可以切换到此：

String filename = MimeUtility.decodeText(bodyPart.getFileName());

实际上，最好也将其与下一行结合：

File f = new File("C:\\Attachments",
                  MimeUtility.decodeText(bodyPart.getFileName()));

这样做比较好，因为它避免了构建文件名的麻烦，而不是手工完成所有工作。（这还意味着您可以将该文字路径名分解为某些配置位置。）

使用JavaMail阅读阿拉伯文附件

相关阅读

相关文章

相关问答

相关工具

相关文档