Module Misc
Miscellaneous useful types and functions
Warning: this module is unstable and part of compiler-libs.
Reporting fatal errors
val fatal_errorf : ('a, Stdlib.Format.formatter, unit, 'b) Stdlib.format4 -> 'aFormat the arguments according to the given format string and raise Fatal_error with the resulting string.
Raise a Fatal_error explaining that a slambda splice shouldn't exist in lambda code after slambda eval has happened.
Exceptions and finalization
try_finally work ~always ~exceptionally is designed to run code in work that may fail with an exception, and has two kind of cleanup routines: always, that must be run after any execution of the function (typically, freeing system resources), and exceptionally, that should be run only if work or always failed with an exception (typically, undoing user-visible state changes that would only make sense if the function completes correctly). For example:
let objfile = outputprefix ^ ".cmo" in
let oc = open_out_bin objfile in
Misc.try_finally
(fun () ->
bytecode
++ Timings.(accumulate_time (Generate sourcefile))
(Emitcode.to_file oc modulename objfile);
Warnings.check_fatal ())
~always:(fun () -> close_out oc)
~exceptionally:(fun _exn -> remove_file objfile);If exceptionally fail with an exception, it is propagated as usual.
If always or exceptionally use exceptions internally for control-flow but do not raise, then try_finally is careful to preserve any exception backtrace coming from work or always for easier debugging.
reraise_preserving_backtrace e f is (f (); raise e) except that the current backtrace is preserved, even if f uses exceptions internally.
List operations
map_end f l t is map f l @ t, just more efficient.
rev_map_end f l t is map f (rev l) @ t, just more efficient.
Like List.map, with guaranteed left-to-right evaluation order
Same as List.for_all but for a binary predicate. In addition, this for_all2 never fails: given two lists with different lengths, it returns false.
replicate_list elem n is the list with n elements all identical to elem.
list_remove x l returns a copy of l with the first element equal to x removed.
Return the last element and the other elements of the given list.
Hash table operations
val create_hashtable : int -> ('a * 'b) list -> ('a, 'b) Stdlib.Hashtbl.tCreate a hashtable with the given initial size and fills it with the given bindings.
Extensions to the standard library
module Stdlib : sig ... endOperations on files and file paths
Search a relative file in a list of directories.
Same as find_in_path_rel , but search also for normalized unit filename, i.e. if name is Foo.ml, allow /path/Foo.ml and /path/foo.ml to match.
Delete the given file if it exists and is a regular file. Does nothing for other kinds of files. Never raises an error.
Delete the given directory if it exists, is a directory, and is empty. Never raises an error.
Delete all files in the given directory, then delete the directory itself. Only handles flat directories (no subdirectories). Never raises an error.
expand_directory alt file eventually expands a + at the beginning of file into alt (an alternate root directory)
split_path_contents ?sep s interprets s as the value of a "PATH"-like variable and returns the corresponding list of directories. s is split using the platform-specific delimiter, or ~sep if it is passed.
Returns the empty list if s is empty.
val copy_file : Stdlib.in_channel -> Stdlib.out_channel -> unitcopy_file ic oc reads the contents of file ic and copies them to oc. It stops when encountering EOF on ic.
val copy_file_chunk : Stdlib.in_channel -> Stdlib.out_channel -> int -> unitcopy_file_chunk ic oc n reads n bytes from ic and copies them to oc. It raises End_of_file when encountering EOF on ic.
val string_of_file : Stdlib.in_channel -> stringstring_of_file ic reads the contents of file ic and copies them to a string. It stops when encountering EOF on ic.
val output_to_file_via_temporary :
?mode:Stdlib.open_flag list ->
string ->
(string -> Stdlib.out_channel -> 'a) ->
'aProduce output in temporary file, then rename it (as atomically as possible) to the desired output file name. output_to_file_via_temporary filename fn opens a temporary file which is passed to fn (name + output channel). When fn returns, the channel is closed and the temporary file is renamed to filename.
val protect_output_to_file : string -> (Stdlib.out_channel -> 'a) -> 'aOpen the given filename for binary writing, pass the out_channel to the given function, then close the channel. If the function raises an exception, then filename will be removed and the backtrace is printed; otherwise, the file name is recorded, and the file can still be retroactively removed by remove_successful_output_files.
Remove all successful writes done by protect_output_to_file.
Create a temporary directory with a random number in the name.
concat_null_terminated [x1;x2; ... xn] is x1 ^ "\000" ^ x2 ^ "\000" ^ ... ^ xn ^ "\000"
split_null_terminated s is similar String.split_on_char '\000' but ignores the trailing separator, if any
Return the given file name without its extensions. The extensions is the longest suffix starting with a period and not including a directory separator, .xyz.uvw for instance.
Return the given name if it does not contain an extension.
Integer operations
log2_nativeint n computes floor (log2 n) when n > 0 . If n is also a power of 2, the result s satisfies n = Nativeint.shift_left 1n s
no_overflow_add n1 n2 returns true if the computation of n1 + n2 does not overflow.
no_overflow_sub n1 n2 returns true if the computation of n1 - n2 does not overflow.
no_overflow_mul n1 n2 returns true if the computation of n1 * n2 does not overflow.
no_overflow_lsl n k returns true if the computation of n lsl k does not overflow.
module Int_literal_converter : sig ... endfind_first_mono p takes an integer predicate p : int -> bool that we assume: 1. is monotonic on natural numbers: if a <= b then p a implies p b, 2. is satisfied for some natural numbers in range 0; max_int (this is equivalent to: p max_int = true).
find_first_mono p is the smallest natural number N that satisfies p, computed in O(log(N)) calls to p.
Our implementation supports two cases where the preconditions on p are not respected:
- If
pis alwaysfalse, we silently returnmax_intinstead of looping or crashing. - If
pis non-monotonic but eventually true, we return some satisfying value.
String operations
search_substring pat str start returns the position of the first occurrence of string pat in string str. Search starts at offset start in str. Raise Not_found if pat does not occur.
replace_substring ~before ~after str replaces all occurrences of before with after in str and returns the resulting string.
rev_split_words s splits s in blank-separated words, and returns the list of words in reverse order.
String.cut_at s c returns a pair containing the sub-string before the first occurrence of c in s, and the sub-string after the first occurrence of c in s. let (before, after) = String.cut_at s c in before ^ String.make 1 c ^ after is the identity if s contains c.
Raise Not_found if the character does not appear in the string
ordinal_suffix n is the appropriate suffix to append to the numeral n as an ordinal number: 1 -> "st", 2 -> "nd", 3 -> "rd", 4 -> "th", and so on. Handles larger numbers (e.g., 42 -> "nd") and the numbers 11--13 (which all get "th") correctly.
format_as_unboxed_literal constant_literal converts constant_literal to its corresponding unboxed literal by either adding "#" in front or changing "-" to "-#".
Examples:
0.1 to #0.1 -3 to -#3 0xa.cp-1 to #0xa.cp-1
normalise_eol s returns a fresh copy of s with any '\r' characters removed. Intended for pre-processing text which will subsequently be printed on a channel which performs EOL transformations (i.e. Windows)
delete_eol_spaces s returns a fresh copy of s with any end of line spaces removed. Intended to normalize the output of the toplevel for tests.
Operations on references
val protect_refs : Misc.ref_and_value list -> (unit -> 'a) -> 'aprotect_refs l f temporarily sets r to v for each R (r, v) in l while executing f. The previous contents of the references is restored even if f raises an exception, without altering the exception backtrace.
val get_ref : 'a list Stdlib.ref -> 'a listget_ref lr returns the content of the list reference lr and reset its content to the empty list.
val set_or_ignore : ('a -> 'b option) -> 'b option Stdlib.ref -> 'a -> unitset_or_ignore f opt x sets opt to f x if it returns Some _, or leaves it unmodified if it returns None.
Operations on triples and quadruples
Long strings
``Long strings'' are mutable arrays of characters that are not limited in length to Sys.max_string_length.
module LongString : sig ... endSpell checking and ``did you mean'' suggestions
edit_distance a b cutoff computes the edit distance between strings a and b. To help efficiency, it uses a cutoff: if the distance d is smaller than cutoff, it returns Some d, else None.
The distance algorithm currently used is Damerau-Levenshtein: it computes the number of insertion, deletion, substitution of letters, or swapping of adjacent letters to go from one word to the other. The particular algorithm may change in the future.
spellcheck env name takes a list of names env that exist in the current environment and an erroneous name, and returns a list of suggestions taken from env, that are close enough to name that it may be a typo for one of them.
val did_you_mean : Stdlib.Format.formatter -> (unit -> string list) -> unitdid_you_mean ppf get_choices hints that the user may have meant one of the option returned by calling get_choices. It does nothing if the returned list is empty.
The unit -> ... thunking is meant to delay any potentially-slow computation (typically computing edit-distance with many things from the current environment) to when the hint message is to be printed. You should print an understandable error message before calling did_you_mean, so that users get a clear notification of the failure even if producing the hint is slow.
module Color : sig ... endStyling handling for terminal output
module Style : sig ... endmodule Error_style : sig ... endFormatted output
val print_if :
Stdlib.Format.formatter ->
bool Stdlib.ref ->
(Stdlib.Format.formatter -> 'a -> unit) ->
'a ->
'aprint_if ppf flag fmt x prints x with fmt on ppf if flag is true.
val pp_two_columns :
?sep:string ->
?max_lines:int ->
Stdlib.Format.formatter ->
(string * string) list ->
unitpp_two_columns ?sep ?max_lines ppf l prints the lines in l as two columns separated by sep ("|" by default). max_lines can be used to indicate a maximum number of lines to print -- an ellipsis gets inserted at the middle if the input has too many lines.
Example:
pp_two_columns ~max_lines:3 Format.std_formatter [ "abc", "hello"; "def", "zzz"; "a" , "bllbl"; "bb" , "dddddd"; ]
prints
abc | hello ... bb | dddddd
val pp_table : Stdlib.Format.formatter -> (string * string list) list -> unitpp_table ppf l prints the table l, a list of columns with their header. The function fails with a fatal error if the columns have different length.
val pp_parens_if :
bool ->
(Stdlib.Format.formatter -> 'a -> unit) ->
Stdlib.Format.formatter ->
'a ->
unitpp_parens_if bool formatter ppf arg prints formatter ppf arg, wrapping it with () if bool is true.
val pp_nested_list :
nested:bool ->
pp_element:(nested:bool -> Stdlib.Format.formatter -> 'a -> unit) ->
pp_sep:(Stdlib.Format.formatter -> unit -> unit) ->
Stdlib.Format.formatter ->
'a list ->
unitpp_nested_list ~nested ~pp_element ~pp_sep ppf args prints the list args with pp_element on ppf. The elements are separated by pp_sep. If ~nested is true, the list is wrapped in parens. The element printer is always called with nested:true, indicating that any inner lists are nested and need parens.
val print_see_manual : Stdlib.Format.formatter -> int list -> unitSee manual section
val output_of_print :
(Stdlib.Format.formatter -> 'a -> unit) ->
Stdlib.out_channel ->
'a ->
unitoutput_of_print print produces an output function from a pretty printer. Note that naively using Format.formatter_of_out_channel typechecks but doesn't work because it fails to flush the formatter.
val is_print_longer_than : int -> (Stdlib.Format.formatter -> unit) -> boolReturns true if the printed string is longer than the given integer. Stops early if so. Spaces and newlines are counted, but indentation is not.
val to_string_of_print :
(Stdlib.Format.formatter -> 'a -> unit) ->
'a ->
stringto_string_of_print print produces a string conversion function from a pretty printer. This is similar but preferable to Format.asprintf "%a" when the output may be large, since to_string functions don't usually return embedded newlines.
Displaying configuration variables
Display the values of all compiler configuration variables from module Config, then exit the program with code 0.
Display the value of the given configuration variable, then exit the program with code 0.
Handling of build maps
Build maps cause the compiler to normalize file names embedded in object files, thus leading to more reproducible builds.
val get_build_path_prefix_map : unit -> Build_path_prefix_map.map optionReturns the map encoded in the BUILD_PATH_PREFIX_MAP environment variable.
Returns the list of --debug-prefix-map flags to be passed to the assembler, built from the BUILD_PATH_PREFIX_MAP environment variable.
module Bitmap : sig ... endHandling of magic numbers
module Magic_number : sig ... enda typical magic number is "Caml1999I011"; it is formed of an alphanumeric prefix, here Caml1990I, followed by a version, here 011. The prefix identifies the kind of the versioned data: here the I indicates that it is the magic number for .cmi files.
module Le_result : sig ... endThe result of a less-than-or-equal comparison
Propositional equality
module type T = sig ... endUtilities for module-level programming
module type T1 = sig ... endmodule type T2 = sig ... endmodule type T3 = sig ... endmodule type T4 = sig ... endMiscellaneous type aliases
type alerts = string Misc.Stdlib.String.Map.tmodule Json : sig ... endmodule Nonempty_list : sig ... endNon-empty lists
module Maybe_bounded : sig ... endA bounded non-negative integer. The possible ranges are 0 ..< n, represented by Bounded { bound = n} and 0 ..< ∞ represented by Unbounded.