问题：

Java正则表达式Matcher.matches函数与整个字符串不匹配

詹弘毅

2023-03-14

我正在尝试将整个字符串与正则表达式匹配，但即使整个字符串不匹配，Matcher.match 函数也会返回 true。

import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class Example {
    public static void main(String[] args) {
        final String string = "\"query1\" \"query2\" \"query3\"";
       // Unescaped Pattern: (\+?".*?[^\\]")(\s+[aA][nN][dD]\s+\+?".*?[^\\]")* 
       final Pattern QPATTERN = Pattern.compile("(\\+?\".*?[^\\\\]\")(\\s+[aA][nN][dD]\\s+\\+?\".*?[^\\\\]\")*", Pattern.MULTILINE);
        Matcher matcher = QPATTERN.matcher(string);
      
        System.out.println(matcher.matches());
        matcher = QPATTERN.matcher(string);  
        while (matcher.find()) {
            System.out.println("Full match: " + matcher.group(0));
            
            for (int i = 1; i <= matcher.groupCount(); i++) {
                System.out.println("Group " + i + ": " + matcher.group(i));
            }
        }
    }
}

您可以从while循环中看到正则表达式只匹配字符串“query1”、“query2”和“query3”的一部分，而不匹配整个字符串。然而， matcher.matches（）返回true。

我哪里出错了？

我也检查了https://regex101.com/的图案，整个字符串都不匹配。

共有2个答案

康锦

2023-03-14

使用grouping（）时，匹配项将被分成组，因此您永远不会将整个字符串放在一组中。正则表达式本身看起来不错，但可能需要一些调整。此线程可能会帮助您：Regex查找所有匹配项

我也是新来的，所以很抱歉不能提供更多的帮助。

彭硕

2023-03-14

< code>matches()方法返回true，因为它需要完整的字符串匹配。您说您在regex101.com上测试了正则表达式，但是您忘记添加锚来模拟< code>matches()行为。

请参阅正则表达式证明您的正则表达式匹配整个字符串。

如果你想停止用这个表达式匹配整个字符串，不要使用 .*？，这个模式可以匹配很多。

用

(?s)(\+?\"[^\"\\]*(?:\\.[^\"\\]*)*\")(\s+[aA][nN][dD]\s+\+?\"[^\"\\]*(?:\\.[^\"\\]*)*\")*

转义版本:

String regex = "(?s)(\\+?\"[^\"\\\\]*(?:\\\\.[^\"\\\\]*)*\")(\\s+[aA][nN][dD]\\s+\\+?\"[^\"\\\\]*(?:\\\\.[^\"\\\\]*)*\")*";

说明

--------------------------------------------------------------------------------
  (?s)                     set flags for this block (with . matching
                           \n) (case-sensitive) (with ^ and $
                           matching normally) (matching whitespace
                           and # normally)
--------------------------------------------------------------------------------
  (                        group and capture to \1:
--------------------------------------------------------------------------------
    \+?                      '+' (optional (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    \"                       '"'
--------------------------------------------------------------------------------
    [^\"\\]*                 any character except: '\"', '\\' (0 or
                             more times (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    (?:                      group, but do not capture (0 or more
                             times (matching the most amount
                             possible)):
--------------------------------------------------------------------------------
      \\                       '\'
--------------------------------------------------------------------------------
      .                        any character
--------------------------------------------------------------------------------
      [^\"\\]*                 any character except: '\"', '\\' (0 or
                               more times (matching the most amount
                               possible))
--------------------------------------------------------------------------------
    )*                       end of grouping
--------------------------------------------------------------------------------
    \"                       '"'
--------------------------------------------------------------------------------
  )                        end of \1
--------------------------------------------------------------------------------
  (                        group and capture to \2 (0 or more times
                           (matching the most amount possible)):
--------------------------------------------------------------------------------
    \s+                      whitespace (\n, \r, \t, \f, and " ") (1
                             or more times (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    [aA]                     any character of: 'a', 'A'
--------------------------------------------------------------------------------
    [nN]                     any character of: 'n', 'N'
--------------------------------------------------------------------------------
    [dD]                     any character of: 'd', 'D'
--------------------------------------------------------------------------------
    \s+                      whitespace (\n, \r, \t, \f, and " ") (1
                             or more times (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    \+?                      '+' (optional (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    \"                       '"'
--------------------------------------------------------------------------------
    [^\"\\]*                 any character except: '\"', '\\' (0 or
                             more times (matching the most amount
                             possible))
--------------------------------------------------------------------------------
    (?:                      group, but do not capture (0 or more
                             times (matching the most amount
                             possible)):
--------------------------------------------------------------------------------
      \\                       '\'
--------------------------------------------------------------------------------
      .                        any character
--------------------------------------------------------------------------------
      [^\"\\]*                 any character except: '\"', '\\' (0 or
                               more times (matching the most amount
                               possible))
--------------------------------------------------------------------------------
    )*                       end of grouping
--------------------------------------------------------------------------------
    \"                       '"'
--------------------------------------------------------------------------------
  )*                       end of \2 (NOTE: because you are using a
                           quantifier on this capture, only the LAST
                           repetition of the captured pattern will be
                           stored in \2)

Java正则表达式Matcher.matches函数与整个字符串不匹配

共有2个答案

相关问答

相关文章

相关阅读

相关工具

相关文档