xref: /illumos-gate/usr/src/man/man3c/glob.3c (revision bbf21555)
Copyright (c) 1989, 1991, 1993, 1994
The Regents of the University of California. All rights reserved.

This code is derived from software contributed to Berkeley by
Guido van Rossum.
Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions
are met:
1. Redistributions of source code must retain the above copyright
notice, this list of conditions and the following disclaimer.
2. Redistributions in binary form must reproduce the above copyright
notice, this list of conditions and the following disclaimer in the
documentation and/or other materials provided with the distribution.
3. Neither the name of the University nor the names of its contributors
may be used to endorse or promote products derived from this software
without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
SUCH DAMAGE.

Sun Microsystems, Inc. gratefully acknowledges The Open Group for
permission to reproduce portions of its copyrighted documentation.
Original documentation from The Open Group can be obtained online at
http://www.opengroup.org/bookstore/.

The Institute of Electrical and Electronics Engineers and The Open
Group, have given us permission to reprint portions of their
documentation.

In the following statement, the phrase ``this text'' refers to portions
of the system documentation.

Portions of this text are reprinted and reproduced in electronic form
in the SunOS Reference Manual, from IEEE Std 1003.1, 2004 Edition,
Standard for Information Technology -- Portable Operating System
Interface (POSIX), The Open Group Base Specifications Issue 6,
Copyright (C) 2001-2004 by the Institute of Electrical and Electronics
Engineers, Inc and The Open Group. In the event of any discrepancy
between these versions and the original IEEE and The Open Group
Standard, the original IEEE and The Open Group Standard is the referee
document. The original Standard can be obtained online at
http://www.opengroup.org/unix/online.html.

This notice shall appear on any product containing this material.

The contents of this file are subject to the terms of the
Common Development and Distribution License (the "License").
You may not use this file except in compliance with the License.

You can obtain a copy of the license at usr/src/OPENSOLARIS.LICENSE
or http://www.opensolaris.org/os/licensing.
See the License for the specific language governing permissions
and limitations under the License.

When distributing Covered Code, include this CDDL HEADER in each
file and include the License file at usr/src/OPENSOLARIS.LICENSE.
If applicable, add the following below this CDDL HEADER, with the
fields enclosed by brackets "[]" replaced with your own identifying
information: Portions Copyright [yyyy] [name of copyright owner]


Copyright (c) 1992, X/Open Company Limited. All Rights Reserved.
Portions Copyright (c) 2003, Sun Microsystems, Inc. All Rights Reserved.
Portions Copyright (c) 2013, Gary Mills

GLOB 3C "Nov 1, 2003"
NAME
glob, globfree - generate path names matching a pattern
SYNOPSIS

#include <glob.h>

int glob(const char *restrict pattern, int flags,
 int(*errfunc)(const char *epath, int eerrno),
 glob_t *restrict pglob);

void globfree(glob_t *pglob);
DESCRIPTION

The glob() function is a path name generator.

The globfree() function frees any memory allocated by glob() associated with pglob.

"pattern Argument"

The argument pattern is a pointer to a path name pattern to be expanded. The glob() function matches all accessible path names against this pattern and develops a list of all path names that match. In order to have access to a path name, glob() requires search permission on every component of a path except the last, and read permission on each directory of any filename component of pattern that contains any of the following special characters:

* ? [
"pglob Argument"

The structure type glob_t is defined in the header <glob.h> and includes at least the following members:

size_t gl_pathc; /* Total count of paths matched by */
 /* pattern */
char **gl_pathv; /* List of matched path names */
size_t gl_offs; /* # of slots reserved in gl_pathv */
int gl_matchc; /* Count of paths matching pattern. */
int gl_flags; /* Copy of flags parameter to glob. */

The glob() function stores the number of matched path names into pglob->gl_pathc and a pointer to a list of pointers to path names into pglob->gl_pathv. The path names are in sort order as defined by the current setting of the LC_COLLATE category. The first pointer after the last path name is a NULL pointer. If the pattern does not match any path names, the returned number of matched paths is set to 0, and the contents of pglob->gl_pathv are implementation-dependent.

It is the caller's responsibility to create the structure pointed to by pglob. The glob() function allocates other space as needed, including the memory pointed to by gl_pathv. The globfree() function frees any space associated with pglob from a previous call to glob().

"flags Argument"

The flags argument is used to control the behavior of glob(). The value of flags is a bitwise inclusive OR of zero or more of the following constants, which are defined in the header <glob.h>: GLOB_APPEND

Append path names generated to the ones from a previous call to glob().

GLOB_DOOFFS

Make use of pglob->gl_offs. If this flag is set, pglob->gl_offs is used to specify how many NULL pointers to add to the beginning of pglob->gl_pathv. In other words, pglob->gl_pathv will point to pglob->gl_offs NULL pointers, followed by pglob->gl_pathc path name pointers, followed by a NULL pointer.

GLOB_ERR

Causes glob() to return when it encounters a directory that it cannot open or read. Ordinarily, glob() continues to find matches.

GLOB_MARK

Each path name that is a directory that matches pattern has a slash appended.

GLOB_NOCHECK

If pattern does not match any path name, then glob() returns a list consisting of only pattern, and the number of matched path names is 1.

GLOB_NOESCAPE

Disable backslash escaping.

GLOB_NOSORT

Ordinarily, glob() sorts the matching path names according to the current setting of the LC_COLLATE category. When this flag is used the order of path names returned is unspecified.

GLOB_ALTDIRFUNC

The following additional fields in the pglob structure have been initialized with alternate functions for glob() to use to open, read, and close directories and to get stat information on names found in those directories:

void *(*gl_opendir)(const char *);
struct dirent *(*gl_readdir)(void *);
void (*gl_closedir)(void *);
int (*gl_lstat)(const char *, struct stat *);
int (*gl_stat)(const char *, struct stat *);
This extension is provided to allow programs such as ufsrestore(8) to provide globbing from directories stored on tape.
GLOB_BRACE

Pre-process the pattern string to expand `{pat,pat,...}' strings like csh(1). The pattern `{}' is left unexpanded for historical reasons. (csh(1) does the same thing to ease typing of find(1) patterns.)

GLOB_MAGCHAR

Set by the glob() function if the pattern included globbing characters. See the description of the usage of the gl_matchc structure member for more details.

GLOB_NOMAGIC

Is the same as GLOB_NOCHECK but it only appends the pattern if it does not contain any of the special characters `*', `?', or `['. GLOB_NOMAGIC is provided to simplify implementing the historic csh(1) globbing behavior and should probably not be used anywhere else.

GLOB_QUOTE

This option has no effect and is included for backwards compatibility with older sources.

GLOB_TILDE

Expand patterns that start with `~' to user name home directories.

GLOB_LIMIT

Limit the amount of memory used by matches to ARG_MAX. This option should be set for programs that can be coerced to a denial of service attack via patterns that expand to a very large number of matches, such as a long string of `*/../*/..'.

GLOB_KEEPSTAT

Retain a copy of the stat(2) information retrieved for matching paths in the gl_statv array:

struct stat **gl_statv;
This option may be used to avoid lstat(2) lookups in cases where they are expensive.

The GLOB_APPEND flag can be used to append a new set of path names to those found in a previous call to glob(). The following rules apply when two or more calls to glob() are made with the same value of pglob and without intervening calls to globfree():

1. The first such call must not set GLOB_APPEND. All subsequent calls must set it.

2. All the calls must set GLOB_DOOFFS, or all must not set it.

3. After the second call, pglob->gl_pathv points to a list containing the following:

a. Zero or more NULL pointers, as specified by GLOB_DOOFFS and pglob->gl_offs.

b. Pointers to the path names that were in the pglob->gl_pathv list before the call, in the same order as before.

c. Pointers to the new path names generated by the second call, in the specified order.

4. The count returned in pglob->gl_pathc will be the total number of path names from the two calls.

5. The application can change any of the fields after a call to glob(). If it does, it must reset them to the original value before a subsequent call, using the same pglob value, to globfree() or glob() with the GLOB_APPEND flag.

"errfunc and epath Arguments"

If, during the search, a directory is encountered that cannot be opened or read and errfunc is not a NULL pointer, glob() calls (*errfunc) with two arguments:

1. The epath argument is a pointer to the path that failed.

2. The eerrno argument is the value of errno from the failure, as set by the opendir(3C), readdir(3C) or stat(2) functions. (Other values may be used to report other errors not explicitly documented for those functions.)

If (*errfunc) is called and returns non-zero, or if the GLOB_ERR flag is set in flags, glob() stops the scan and returns GLOB_ABORTED after setting gl_pathc and gl_pathv in pglob to reflect the paths already scanned. If GLOB_ERR is not set and either errfunc is a NULL pointer or (*errfunc) returns 0, the error is ignored.

RETURN VALUES

On successful completion, glob() returns zero. In addition the fields of pglob contain the values described below: gl_pathc

Contains the total number of matched pathnames so far. This includes other matches from previous invocations of glob() if GLOB_APPEND was specified.

gl_matchc

Contains the number of matched pathnames in the current invocation of glob().

gl_flags

Contains a copy of the flags parameter with the bit GLOB_MAGCHAR set if pattern contained any of the special characters `*', `?', or `[', cleared if not.

gl_pathv

Contains a pointer to a null-terminated list of matched pathnames. However, if gl_pathc is zero, the contents of gl_pathv are undefined.

gl_statv

If the GLOB_KEEPSTAT flag was set, gl_statv contains a pointer to a null-terminated list of matched stat(2) objects corresponding to the paths in gl_pathc.

If glob() terminates due to an error, it sets errno and returns one of the following non-zero constants. defined in <glob.h>: GLOB_ABORTED

The scan was stopped because GLOB_ERR was set or (*errfunc) returned non-zero.

GLOB_NOMATCH

The pattern does not match any existing path name, and GLOB_NOCHECK was not set in flags.

GLOB_NOSPACE

An attempt to allocate memory failed.

GLOB_NOSYS

The requested function is not supported by this version of glob().

The arguments pglob->gl_pathc and pglob->gl_pathv are still set as specified above.

The globfree() function returns no value.

USAGE

This function is not provided for the purpose of enabling utilities to perform path name expansion on their arguments, as this operation is performed by the shell, and utilities are explicitly not expected to redo this. Instead, it is provided for applications that need to do path name expansion on strings obtained from other sources, such as a pattern typed by a user or read from a file.

If a utility needs to see if a path name matches a given pattern, it can use fnmatch(3C).

Note that gl_pathc and gl_pathv have meaning even if glob() fails. This allows glob() to report partial results in the event of an error. However, if gl_pathc is 0, gl_pathv is unspecified even if glob() did not return an error.

The GLOB_NOCHECK option could be used when an application wants to expand a path name if wildcards are specified, but wants to treat the pattern as just a string otherwise.

The new path names generated by a subsequent call with GLOB_APPEND are not sorted together with the previous path names. This mirrors the way that the shell handles path name expansion when multiple expansions are done on a command line.

Applications that need tilde and parameter expansion should use the wordexp(3C) function.

EXAMPLES

Example 1 Example of glob_doofs function.

One use of the GLOB_DOOFFS flag is by applications that build an argument list for use with the execv(), execve(), or execvp() functions (see exec(2)). Suppose, for example, that an application wants to do the equivalent of:

ls -l *.c

but for some reason:

system("ls -l *.c")

is not acceptable. The application could obtain approximately the same result using the sequence:

globbuf.gl_offs = 2;
glob ("*.c", GLOB_DOOFFS, NULL, &globbuf);
globbuf.gl_pathv[0] = "ls";
globbuf.gl_pathv[1] = "-l";
execvp ("ls", &globbuf.gl_pathv[0]);

Using the same example:

ls -l *.c *.h

could be approximately simulated using GLOB_APPEND as follows:

globbuf.gl_offs = 2;
glob ("*.c", GLOB_DOOFFS, NULL, &globbuf);
glob ("*.h", GLOB_DOOFFS|GLOB_APPEND, NULL, &globbuf);
.\|.\|.
ATTRIBUTES

See attributes(7) for descriptions of the following attributes:

ATTRIBUTE TYPE ATTRIBUTE VALUE
Interface Stability Standard
MT-Level MT-Safe
SEE ALSO

execv (2), stat (2), fnmatch (3C), opendir (3C), readdir (3C), wordexp (3C), attributes (7), standards (7)